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NOVEL PLEXINS AND USES THEREOF 
BACKGROUND OF THE INVENTION 

Field of the Invention 

5 The invention relates to the identification and characterization of four novel 

proteins that are members of the plexin family. 
Summary of the Related Art 

Plexin family members encode large transmembrane proteins, whose cysteine- 
rich extracellular domains share regions of homology with the Scatter Factor receptors 

10 (encoded by the Met gene family). The extracellular domains of plexins also contain 
-500 amino acid Semaphorin domains (see below). The highly conserved cytoplasmic 
moieties of plexins (approx. 600 amino acids), however, have no homology with the 
Met tyrosine kinase domain, nor with any other known protein. Met-like receptors and 
their ligands, the Scatter Factors, mediate a complex biological program including 

15 dissociation of cell-cell contacts, motility and invasion (for a review see Tamagnone, L. 
and Comoglio, P.M. (1997) "Control of invasive growth by hepatocyte growth factor 
(HGF) and related scatter factors." Cytokine Growth Factor Rev 8, 129-142). During 
embryogenesis Scatter Factor- 1 and Met promote the dissociation of cell layers in the 
somites and drive the migration of myogenic cells to their appropriate location (Bladt, 

20 F., Riethmacher, D., Isenmann, S., Aguzzi, A., and Birchmeier, C. (1995) "Essential 
role for the c-met receptor in the migration of myogenic precursor cells into the limb 
bud." Nature 376, 768-771; Maina, F., Casagranda, F., Audero, E., Simeone, A., 
Comoglio, P.M., Klein, R.a., and Ponzetto, C. (1996) "Uncoupling of grb2 from the 
met receptor in vivo reveals complex roles in muscle development." Cell 87, 531-542). 

25 Met and Scatter Factor- 1 are also involved in controlling neurite outgrowth and axonal 
guidance (Ebens, A., Brose, K., Leonardo, E.D., Hanson, M.G.J., Bladt, F., Birchmeier, 
C, Barres, B.A., and Tessier-Lavigne, M. (1996) "Hepatocyte growth factor/scatter 
factor is an axonal chemoattractant and a neurotrophic factor for spinal motor neurons." 
Neuron 17, 1157-1172; Maina, F., Hilton, M.C., Ponzetto, C, Davies, A.M., and 

30 Klein, R. (1997) "Met receptor signaling is required for sensory nerve development and 
HGF promotes axonal growth and survival of sensory neurons." Genes Dev 11, 3341- 
3350; Maina, F., Hilton, M.C., Andres, R., Wyatt, S., Klein, R., and Davies, A.M. 
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(1998) "Multiple roles for hepatocyte growth factor in sympathetic neuron 
development." Neuron 20, 835-846). 

The first clue regarding a possible function for plexins came from the finding 
that a novel plexin, Vespr, interacts with the viral semaphorin A39R (Comeau, M.R., 

5 Johnson, R., DuBose, R.F., Petersen, M., Gearing, P., VandenBos, T., Park, L., Farrah, 
T., Buller, R.M., Cohen, J.I., Strockbine, L.D., Rauch, C, and Spriggs, M.K. (1998) "A 
poxvirus-encoded semaphorin induces cytokine production from monocytes and binds 
to a novel cellular semaphorin receptor, VESPR." Immunity. 8, 473-482). Semaphorins 
are a large family of secreted and membrane-bound molecules that are characterized by 

10 an extracellular domain containing a -500 amino acid Semaphorin domain (Kolodkin et 
al. (1993) "The semaphorin genes encode a family of transmembrane and secreted 
growth cone guidance molecules." Cell 75, 1389-1399). As noted above, plexins 
contain a more divergent but nevertheless conserved Semaphorin domain. 

Semaphorins were originally characterized in the nervous system, where they 

15 have been implicated in repulsive axon guidance (Kolodkin et al. (1993) supra; Luo, 
Y., Raible, D., and Raper, J. A. (1993) "Collapsin: a protein in brain that induces the 
collapse and paralysis of neuronal growth cones." Cell 75, 217-227; Tessier-Lavigne, 
M. and Goodman, C.S. (1996) "The molecular biology of axon guidance." Science 
274, 1123-1133). More recently, semaphorins have been furthermore implicated in 

20 cardiac and skeletal development (Behar, O., Golden, J.A., Mashimo, H., Schoen, F.J., 
and Fishman, M.C. (1996) "Semaphorin HJ is needed for normal patterning and growth 
of nerves, bones and heart." Nature 383, 525-528), in the immune response (Hall, K.T., 
Boumsell, L., Schultze, J.L., Boussiotis, V.A., Dorfman, D.M., Cardoso, A.A., 
Bensussan, A., Nadler, L.M., and Freeman, G.J. (1996) "Human CD100, a novel 

25 leukocyte semaphorin that promotes B-cell aggregation and differentiation." 

Proc.Natl.Acad.Sci.U.S.A. 93, 11780-11785), in the regulation of angiogenesis (Miao, 
H.Q., Soker, S., Feiner, L., Alonso, J.L., Raper, J.A., and Klagsbrun, M. (1999) 
"Neuropilin-1 mediates collapsin- 1/Semaphorin ITJ inhibition of endothelial cell 
motility. Functional competition of collapsin-1 and vascular endothelial growth factor- 

30 165" [In Process Citation], J Cell Biol 146, 233-242), and in tumor growth and 
metastasis (Christensen, C.R., Klingelhofer, J., Tarabykina, S., Hulgaard, E.F., 
Kramerov, D., and Lukanidin, E. (1998) "Transcription of a novel mouse semaphorin 
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gene, M-semaH, correlates with the metastatic ability of mouse tumor cell lines." 
Cancer Res. 58, 1238-1244). 

Previously identified plexins have been shown to be expressed in the developing 
nervous system, (i.e. Plexin-A is a receptor for class 1 semaphorins (Sema-la and 
5 Sema-lb). Moreover, Plexin-A has been shown via genetic analysis to control motor 
and CNS axon guidance induced by semaphorins (Winberg, M.L., Noordermeer, J.N., 
Tamagnone, L., Comoglio, P.M., Spriggs, M.K., Tessier-Lavigne, M., and Goodman, 
C.S. (1998). Plexin A is a neuronal semaphorin receptor that controls axon guidance. 
Cell 95, 903-916). 

10 Thus a need exists for discovery of other members of the plexin family of 

proteins. 

SUMMARY OF THE INVENTION 

The present invention provides four novel plexin family members. 

In one aspect, the invention provides an isolated nucleic acid having at least 
15 80% nucleic acid sequence identity to a nucleotide sequence that encodes an amino acid 
sequence selected from the group consisting of the amino acid sequence shown in (SEQ 
ID NO: 2), (SEQ ID NO: 4), (SEQ ID NO: 6) and (SEQ ID NO: 8). 

In other aspects, the invention provides isolated nucleic acid having at least 
80% nucleic acid sequence identity to a nucleotide sequence selected from the group 
20 consisting of the nucleotide sequence shown in (SEQ ID NO: 1), (SEQ ID NO: 3), 
(SEQ ID NO: 5) and (SEQ ID NO: 7). 

In another aspect, the invention provides a vector comprising the nucleic 
acid of the above-aspects. 

The invention also provides an isolated polypeptide the amino acid 
25 sequence of which comprises residues 1-18, 19-518, 451-530, 601-680, 751-830, 
800-1010, or 1196-1215 of SEQ ID NO: 2; 1-23, 24-507 or 1100-1119 of SEQ ID 
NO: 4; or 1-42, 43-600, 541-620, 691-770, 831-910, 900-1110 or 1270-1293 of SEQ 
ID NO: 6; or 8-49, 154-199 or 1-199 of SEQ ID NO: 8. 

In another aspect, the invention provides an isolated polypeptide having at 
30 least 80% amino acid sequence identity to an amino acid sequence selected from the 
group consisting of the amino acid sequence shown in (SEQ ID NO: 2), (SEQ ID NO: 
4), (SEQ ID NO: 6) and (SEQ ID NO: 8) 
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The invention also provides a chimeric molecule comprising a polypeptide of 
the above aspects fused to a heterologous amino acid sequence. In one embodiment 
the heterologous amino acid sequence is a Fc region of an immunoglobulin. 
In other aspects, the invention provides an antibody that specifically 
5 binds to the polypeptides of the above aspects. 

The invention also provides a method of treating, suppressing or altering a 
disorder involving aberrant immune regulation involving a signaling pathway between a 
plexin and a neuropilin in a mammal comprising the step of administering an effective 
amount of an agent to said mammal capable of interfering with the association between 
10 the plexin and neuropilin. Contemplated agents include a chimeric molecule or an 
antibody of the above aspects. 

DESCRIPTION OF THE DRAWINGS 

Figure 1. 

(A) Phylogenetic tree of human plexins. Known family members cluster in two 

15 major groups: plexin A and plexin-B subfamilies. (B) Structural features of plexins, 
Met-like receptors and semaphorins. In the extracellular moieties, yellow boxes indicate 
the "sema" domains and blue boxes mark the cysteine-rich MRS motifs, some of which 
are stippled to indicate their atypical sequence; atypical MRS are also found in the sema 
domain of semaphorins. Sequence identity among sema domains ranges from 15-50%, 

20 as previously described (see Winberg et al., 1998 supra). Potential furin-like 

proteolytic sites are marked by red ribbons. plexin-B 1 "truncated" is the product of a 
splicing variant (see text). plexin-Dl and plexin-Cl (VESPR) are more distant family 
members, since they include atypical features in their extracellular domains. The 
intracellular domain of plexins (SP domain) is highly conserved through all family 

25 members, and it includes two separate regions of high homology (Maestrini, E., 

Tamagnone, T., Longati, P., Cremona, O., Gulisano, M., Bione, S., Tamanini, F., Neel, 
B.G., Toniolo, D., and Comoglio, P.M. (1996) "A family of transmembrane proteins 
with homology to the MET-hepatocyte growth factor receptor." Proc. Natl. Acad. Sci. 
USA 93, 674-678) (green oval and box). Met-like receptors are disulfide-bound 

30 heterodimers and include a cytoplasmic tyrosine kinase domain (red box). Mammalian 
semaphorins can be either secreted or cell surface proteins. Molecular weights of 
representative proteins are Plexin-Al 220 kDa, Plexin-B 1 250 kDa, Plexin-Cl 200 
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kDa, HGF-R/Met 145+45 kDa (heterodimer), Sema4D 150 kDa (transmembrane), 
Sema7A approx. 100 kDa (membrane GPI-linked). 
Figure 2. 

(a) Cell surface semaphorins specifically bind human plexins. Micropraphs of 

5 the binding assays done testing i) the extracellular domain of semaphorin CD 100 fused 
to alkaline phosphatase (CD100-AP) on COS cells transfected with plexin-Bl cDNA; 
ii) control AP on plexin-B 1 ; iii) CD100-AP on plexin-B2; iv) CD100-AP on the entire 
extracellular domain of plexin-Bl; v) CD100-AP on isolated "plexin-Bl truncated" 
(including sema domain, 1° and 2° MRS); vi) CD100-AP on a "plexin-B 1-Asema" 

10 (including 2° and 3° MRS; vii) extracellular domain of semaphorin A39R fused to AP, 
on plexin-Cl (Vespr); viii) SemaKl-AP on plexin-Cl. The final detection of the 
binding was done either using alkaline phosphatase substrates (i-vi) or by 
immunofluorescence (vii and viii). (B) Scatchard analysis and binding curve of 
CD100-AP to plexin-Bl (K D = 0.9 nM ± 0.15). 

15 Figure 3. 

Plexins associate with neuropilins via specific extracellular domains. Western 
blots of immunoprecipitated samples from cells co-expressing neuropilins and plexins. 
Specific MoAbs were used, directed against the VSV-tag included in plexins or the 
myc-tag included in neuropilin-2 (Np2, 130 kDa). Np2 co-immunoprecipitates with 

20 plexins, such as plexin-A3 (220 kDa), the extracellular domain of plexinAl 

(approx. 160 kDa), and plexin-Bl (250 kDa) but not with the unrelated cell surface 
receptor DCC (170 kDa). Np2 can also associate a truncated form of the extracellular 
moiety of plexin-Bl ("plex-Bl trunc", approx. 1 10 kDa), containing the sema domain. 
Figure 4. 

25 Expression of mRNAs for plexins Al (panel A, B), -A2 (panel C, D) and A3 

(panel E, F) in the spinal cord (sc), dorsal root ganglia (d) and sympathetic ganglia (sg) 
of E13.5 mouse embryos. Expression of the mRNAs was detected by RNA in situ 
hybridization. Scale bar:l fim. 
Figure 5. 

30 Effect of a truncated plexin-Al construct (lacking the cytoplasmic domain) on 

repulsive and attractive responses of Xenopus spinal neurons to Sema3A and netrin-1. 
(A-F) A control spinal neuron exposed to a gradient of Sema3A emanating from a 
pipette (A) is repelled away over a period of 1 hr (B). In contrast, a GFP-expressing 
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spinal neuron from an embryo injected with mRNA for the truncated plexin-Al 
construct (C) is not affected by Sema3A (D). A similar neuron (E) shows a normal 
attractive response to netrin-1 (F). 

(G) Cumulative distribution plot of turning angles for all the neurons studied. 
5 Curves show the percent of neurons with turning angles less than the angle indicated on 
the abscissa, under different conditions (open circles, control neurons; black and blue 
circles, control neurons responding to Sema3A or netrin-1, respectively; red and green 
circles, responses of neurons expressing the truncated plexin-Al construct to Sema3A 
and netrin-1, respectively. (H) Mean turning angle under all the conditions just 
10 mentioned. 

Figure 6. 

Tyrosine phosphorylation of plexin-A3 and plexin-Bl. (a) Anti- 
phosphotyrosine western blotting of immunoprecipitated p220 plex ~ A3 and p250 plex " B1 
proteins. plexin-B 1 is larger since it contains an extra sequence between the second and 

15 the third MRS motif, in the extracellular domain (see Fig. 1). (b) The same 
immunoprecipitated samples underwent in vitro kinase assay in the presence of 
[y 32 P]ATP , Mg ++ and Mn ++ ions. The SDS-PAGE was treated with alkali in conditions 
known to eliminate the phosphate labeling of Ser/Thr residues and specifically 
preserving phosphotyrosines. 

20 Figure 7 

Plexin-A3 overexpression mediates cell repelling cues, (a) Epithelial kidney 
MDCK cells transfected to overexpress plexin-A3 (or mock transfected) were 
cocultured with mesenchymal KJ-29 or NTH-3T3 cells. After 16-30 hours, mixed 
cultures of control cells (upper panels) reached confluency and stopped growing: 

25 typically the epithelial cells formed islets (circled) surrounded by a fibroblasts lawn. In 
contrast, MDCKs overexpressing plexin-A3 (lower panels) overwhelmed the adjacent 
mesenchymal cells. The latter withdrew and selectively detached from the culture dish 
(dying cell clusters are indicated by arrowheads), and eventually epithelial cells only 
survived. To allow an easier detection of the mesenchymal cells, these were labeled 

30 with Dil before being plated in mixed cultures, (b) Plexin-A3 expressing cells do not 
induce apoptotic signal on repelled fibroblasts. Mixed cultures of NIH 3T3 and control 
or plexin-A3 overexpressing MDCKs were tested for the presence of TUNEL-AP 
positive cells. Apoptotic cells were not present in clusters of repelled cells (indicated by 
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arrows). The right panel shows a positive control where apoptosis was induced on the 
same cells by UV treatment, (c) Plexin-A3 over-expressing cells form very transient 
contacts with fibroblasts. Time-lapse video-microscopy of control and plexin-A3 
overexpressing MDCK cells grown in presence of fibroblasts. On top, snap-shot 

5 images from the movie, taken every 50 minutes (real time). In the upper row is shown 
the persistent contact of a fibroblast (marked by an arrow) with an islet of control 
MDCK cells (marked by a star). In the lower row another fibroblast, instead, forms a 
transient contact with an islet of plexin-A3 transfected cells, which also, in turn, 
reshapes. At the bottom, the diagrams show the relative frequency of persistent, 

10 transient or very transient contacts between fibroblasts and MDCK cells. 

DETAILED DESCRIPTION OF THE INVENTION 
The reference works, patents, patent applications, and scientific literature, 
including accession numbers to GenBank database sequences, that are referred to herein 
establish the knowledge of those with skill in the art and are hereby incorporated by 

15 reference in their entirety to the same extent as if each was specifically and individually 
indicated to be incorporated by reference. Any conflict between any reference cited 
herein and the specific teachings of this specification shall be resolved in favor of the 
later. Likewise, any conflict between an art-understood definition of a word or phrase 
and a definition of the word or phrase as specifically taught in this specification shall be 

20 resolved in favor of the latter. 

Four novel human plexins have been identified: plexin-B2, plexin-B3, plexin-Dl 
and Plexin A-4. Plexin-A4 is located on human chromosome 7 and is a family member 
of the plexin-A subfamily which also includes plexin-Al (alternatively named plexin- 
1/NOV), plexin-A2 (alternatively named plexin-2/OCT) and plexin-A3 (alternatively 

25 named plexin-2/SEX). Plexin-B2 and plexin-B3 are located on human chromosome 22 
and chromosome X, respectively, and are family members of the plexin-B subfamily 
which also includes plexin-Bl (alternatively named SEP). Plexin-B3 maps very close 
to the plexin-A3 genomic locus on Xq28. Plexin-Dl is the first identified member of 
the plexin-D subfamily and is atypical of any of the other subfamilies. A fourth 

30 subfamily of plexins, the plexin-C subfamily, is defined by VESPR (now plexin-Cl). 

The four novel plexins as described herein have in their extracellular domains 
regions of homology with two other protein families: (a) Scatter Factors Receptors, 
encoded by the MET oncogene family (Tamagnone and Comoglio, 1997 supra), and (b) 
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Semaphorins (Kolodkin et al. (1993) supra (Figure lb). In particular, plexins and Met- 
like receptors contain short cysteine-rich motifs, termed "Met Related Sequences" 
(MRS), whose minimal consensus is: C-X(5-6)-C-X(2)-C-X(6-8)-C-X(2)-C-X(3-5)-C 
(Maestrini et al., 1996 supra); Tamagnone and Comoglio, 1997 supra); blue boxes in 
5 Fig. IB). The proteins of the Met family contain a single MRS (in their receptor (3 
chains), whereas in plexin family members there are two/three repeated MRS motifs. 
Furthermore, all plexin family members contain in their extracellular moiety a 500 
amino acid region similar to the sema domain of semaphorins (Kolodkin et al. (1993) 
supra; Winberg et al, 1998 supra); yellow boxes in Fig. IB. The MRS motif is 

10 proposed to function as protein-protein interaction domain. 

The cytoplasmic domain of plexins contains a -600 amino acid domain which we 
term the SP domain ("Sex and Plexins", marked in green in Fig. IB) that is highly 
conserved within the family (57-97% similarity) and in evolution (over 50% similarity 
between invertebrates and humans). The SP domain does not share homology with any 

15 known protein. It includes a number of potential tyrosine phosphorylation sites, but 
lacks the typical motifs of catalytic tyrosine kinases. Interestingly, the predicted 
secondary structure of the SP domain includes long conserved alpha helices, typically 
found in protein -protein interaction modules. Furthermore, there are several 
dihydrophobic amino acid motifs (such as LL or LI), known to mediate the 

20 internalization and downregulation of transmembrane receptors (Sandoval, IV. and 
Bakke O. (1994). Targeting of membrane proteins to endosomes and lysosomes. 
Trends in Cell Biology 4, 292-297). 

The present invention also demonstrates that plexins can form complexes with 
neuropilins, which in turn demonstrates that a receptor for semaphorins can be hetero- 

25 oligomers of plexins and neuropilins. As demonstrated by in situ mRNA expression 
analysis, plexins and neuropilins are in fact simultaneously expressed in several 
neuronal populations during embryonic development. The plexin-neuropilin complex 
predates ligand binding, since the association is not influenced by the presence of class 
3 semaphorins. That the observed plexin-neuropilin complexes are formed in cis is 

30 furthermore supported by the experimental conditions used (cotransfection of isolated 
cells with the two constructs). An interaction in trans might also be envisioned 
(considering that plexins and semaphorins share similar sema domains), however by 
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analyzing mixed cultures of cells separately transfected with plexins and neuropilins we 
did not isolate associated complexes (data not shown). 

We observed that the main semaphorin binding domain of neuropilins (CUB 
domain (Giger, R.J., Urquhart, E.R., Gillespie, S.K., Levengood, D.V., Ginty, D.D., and 
5 Kolodkin, A.L. (1998) "Neuropilin-2 is a receptor for semaphorin IV: insight into the 
structural basis of receptor function and specificity." Neuron 21, 1079-1092; 
Nakamura, R, Tanaka, M., Takahashi, T., Kalb, R.G., and Strittmatter, S.M. (1998) 
"Neuropilin-1 extracellular domains mediate semaphorin D/HI-induced growth cone 
collapse" [In Process Citation]. Neuron 21, 1093-1100; Chen et al. 1998 supra) is not 

10 required for the interaction with plexins, as indicated by the association of the relevant 
Neuropilin-2 deletion construct with plexin-Bl (not shown). A ternary complex, where 
neuropilins use two distinct protein modules to form a bridge between the sema domain 
of semaphorins and the sema domain of plexins is thus contemplated. It is further 
contemplated that plexins are the functional partners of neuropilins, required for 

15 transducing signals mediated by semaphorins, preferably class 3 semaphorins. For 
example, in flies, which lack both neuropilins and class 3 semaphorins, D Plex A 
appears sufficient as a functional receptor for Sema la, a transmembrane class 1 
semaphorin (Winberg et al., 1998 supra). Further support that plexins are functional co- 
receptors for secreted semaphorins is demonstrated in an experiment that shows that a 

20 truncated plexin-Al construct expressed in Xenopus spinal neurons abolishes repulsive 
responses to Sema3A without markedly affecting attractive responses to netrin-1. These 
results are consistent with the involvement of plexins. 

The intracellular signals transduced by plexins are still largely obscure. The 
cytoplasmic domain of plexins is large and highly conserved within and across species 

25 and contains stretches of alpha helices, which are putative protein-protein interaction 
domains, and could thus mediate the association with cytosolic partners. We 
demonstrate herein that the cytoplasmic domain of plexins can be tyrosine 
phosphorylated, suggesting that, like other receptors devoid of intrinsic catalytic 
activity, plexins may signal by associating a tyrosine kinase (Stahl, N. and 

30 Yancopoulos, G.D. (1993). The alphas, betas, and kinases of cytokine receptor 

complexes. Cell 74, 587-590; Glass, D.J., Bowen, D.C., Stitt, T.N., Radziejewski, C, 
Bruno, J., Ryan, T.E., Gies, D.R., Shah, S., Mattsson, K., Burden, S.J., DiStefano, P.S., 
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Valenzuela, D.M., DeChiara, T.M., and Yancopoulos, G.D. (1996). Agrin acts via a 
MuSK receptor complex. Cell 85, 513-523) . 

In addition, we show herein that expression of plexins, particularly plexin-A3, 
mediates cell-repelling cues. By time-lapse video-microscopy we observed a true 
5 repelling effect on fibroblasts. Intriguingly, we observed that -upon interaction with 
fibroblasts- also the islets of plexin-A3 MDCKs at times reshaped. This may be 
explained by the existence of intra-epithelial repelling cues, balanced by the attractive 
forces exerted by epithelial cell junctions. 

Moreover we have demonstrated that in the nervous system (i.e. Drosophila), that 

10 defasciculating motor axons co-express both Plexin A and one of its interacting 

partners, the transmembrane scmaphorin Sema-la (Winberg et ah, 1998 supra). This 
demonstrates that plexins act in vivo either as receptors or ligands for cell surface 
semaphorins, which in turn can transduce intracellular signals, as reported for ephrins 
(Holland et al., 1996 supra). Semaphorins, therefore, besides being pivotal in axon 

15 guidance, have a general role in morphogenesis and tissue remodeling by mediating 
cell-repelling cues via their interactions with plexins. 

Accordingly, in a first aspect, the invention provides an isolated nucleic acid 
molecule encoding a novel human plexin polypeptide. By "plexin polypeptide" is meant 
a member of the plexin family comprising an amino acid sequence that shares at least 

20 60% amino acid sequence homology with SEQ ID NOS: 2 (plexin B-2), 4 (plexin B-3), 6 
(plexin D-l) or 8 (plexin A-4), preferably, at least 65% sequence homology, more 
preferably, at least 70% sequence homology, more preferably, at least at least 75% 
sequence homology, more preferably, at least 80% sequence homology, still more 
preferably at least 85% sequence homology, even more preferably, at least 90% sequence 

25 homology, and most preferably at least 95% sequence homology with SEQ ID NOS: 2, 4, 
6 or 8. Plexin polypeptides of the invention are useful for modulating cell growth (i.e. 
nerve) and immune regulation. 

As used herein, by "modulating" is meant increasing or decreasing cell growth. 
By "cell growth" is meant any change in cell number or size, including, without 

30 limitation, increase or decrease in cell number, increase or decrease in rate of cell 

division, increase or decrease in rate of cell death, and/or increase or decrease in cell size. 
Standard methods for measuring cell growth include standard apoptosis assays {e.g., 
TUNEL assays, DNA fragmentation, trypan blue exclusion) and cell proliferation assays 
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(e.g., 3 H-thymidine incorporation). It will be appreciated that the degree of modulation 
of cell growth provided by a plexin polypeptide in a given assay will vary, but one of 
skill in the art can readily determine the statistically significant change in cell growth of a 
cell exposed to a plexin polypeptide. 
5 By "immune regulation" is meant increasing or decreasing the biological 

functions of immune cells (i.e., cells involved in an immune response). Immune cells 
include, without limitation, lymphocytes (T and B), NK cells, dendritic cells, myeloid 
cells (e.g., macrophages and neutrophils), and other hematopoietic cells involved in an 
immune response. 

10 By "nucleic acid molecule" or "nucleic acid" as used herein, is meant any 

deoxyribonucleic acid (DNA) or ribonucleic acid (RNA), including, without limitation, 
complementary DNA (cDNA), genomic DNA, RNA, heteronuclear RNA (hnRNA), 
messenger RNA (mRNA), DNA/RNA hybrids, or synthetic nucleic acids (e.g., an 
oligonucleotide) comprising ribonucleic and/or deoxyribonucleic acids or synthetic 

15 variants thereof. The nucleic acid of the invention includes, without limitation, an 
oligonucleotide or a polynucleotide. The nucleic acid can be single stranded, or 
partially or completely double stranded (duplex). Duplex nucleic acids can be 
homoduplex or heteroduplex. 

By "polypeptide" is meant any molecule comprising two or more amino acids 

20 joined together with a peptide bone, regardless of length or post-translational 
modifications (e.g., without limitation, glycosylation, lipidation, acetylation, or 
phosphorylation). Useful plexin polypeptides of the invention include, without 
limitation, the full length plexin polypeptides having the amino acid sequence of SEQ 
ID NOS: 2, 4 , 6, 8 or 10,; an extracellular domain of the polypeptide having the amino 

25 acid sequence 1 to 1 199 of SEQ ID NO: 2; 1 to 1099 of SEQ ID NO: 4; 1 to 1270 of 
SEQ ID NO: 6 or 1 to 199 of SEQ ID NO: 8, with its associated signal peptide; or an 
extracellular domain of the polypeptide having the amino acid sequence 19 to 1 199 of 
SEQ ID NO: 2; 24 to 1099 of SEQ ID NO: 4; or 43 to 1270 of SEQ ID NO: 6, lacking 
its associated signal peptide; an intracellular domain of the polypeptide having the 

30 amino acid sequence of SEQ ID NOS: 2, 4, 6 or 8; and polypeptides, the amino acid 
sequence of which comprises about residues 1-18 (putative signal sequence), 19- 
518 (sema domain), 451-530 (1° MRS), 601-680 (2° MRS), 751-830 (3° MRS), 800- 
1010 (G-P repeats) or 1196-1215 (putative transmembrane domain) of SEQ ID 
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NO: 2; about residues 1-23 (putative signal sequence), 24-507 518 (sema 
domain) or 1100-1119 (putative transmembrane domain) of SEQ ID NO: 4; or 
about residues 1-42 (putative signal sequence), 43-600 (sema domain), 541-620 
(1° MRS), 691-770 (2° MRS), 831-910 (3° MRS), 900-1110 (G-P repeats) or 1270- 

5 1293 (putative transmembrane domain) of SEQ ID NO: 6; or about residue 8-49 
(1° MRS) or 154-199 (2° MRS) of SEQ ID NO: 8. 

By "isolated" is meant a compound (e.g., a nucleic acid molecule or a protein) 
that has been separated from components (e.g., nucleic acid molecules, proteins, lipids, 
and/or carbohydrates) which naturally accompany it. Water, buffers, and other small 

10 molecules (e.g. , molecules having a molecular weight of less than about 1000 daltons) 
may accompany an isolated compound of the invention. Preferably, an isolated 
compound is at least 70%, by weight, free from components which naturally 
accompany it. More preferably, an isolated is at least 75%, by weight, free from 
components which naturally accompany it; still more preferably, at least 80%, by 

15 weight, free; even more preferably, at least 85%, by weight, free; and even more 

preferably, at least 90%, by weight, free from components which naturally accompany 
it. Most preferably, a substantially purified compound is at least 95%, by weight, free 
from components which naturally accompany it. 

Where the isolated compound is a nucleic acid molecule, the isolated nucleic acid 

20 molecule is separated from other nucleic acids (e.g., genes or transcripts) or proteins 
which, in the naturally-occurring genome of the organism from which the nucleic acid 
molecule was derived, flanked the nucleic acid molecule. Isolated nucleic acid 
molecules therefore include, without limitation, a recombinant nucleic acid molecule 
incorporated into a plasmid or other vector (e.g., a replication-defective virus); a 

25 recombinant nucleic acid molecule incorporated into the genome of a prokaryotic or 
eukaryotic organism; or a nucleic acid molecule which exists as a separate molecule 
independent of other nucleic acids (e.g., a PCR product, a chemically synthesized nucleic 
acid molecule, or a nucleic acid molecule produced by restriction endonuclease 
digestion). Purification of a nucleic acid molecule can be accomplished and measured by 

30 any standard method including, without limitation, sequence analysis, chemical 

synthesis, PCR, CsCl gradient, phenol :chloroform extraction, ethanol precipitation, 
Southern or Northern blotting analysis followed by band extraction and purification, and 
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high performance liquid chromatography (HPLC; see, e.g., Fisher (1980) Laboratory 
Techniques in Biochemistry and Molecular Biology , Work and Burdon (eds.), Elsevier). 

Thus, in one non-limiting example, to obtain an isolated nucleic acid molecule 
encoding a plexin polypeptide, a nucleic acid molecule is chemically synthesized on a 

5 standard oligonucleotide synthesis machine. The resulting single stranded molecule is 
then subjected to second strand synthesis to form a duplex DNA molecule, which is then 
ligated into a plasmid capable of replication in a prokaryotic or eukaryotic cell. The 
nucleic acid molecule is then replicated in the cell, purified (e.g., by CsCl gradient), and 
subjected to sequence analysis. 

10 In certain embodiments of the first aspect of the invention, the nucleic acid 

molecule has a nucleic acid sequence comprising SEQ ID NOS: 1, 3, 5, 7 or 9. 
Preferably, the nucleic acid molecule of the invention has not more than 500 nucleotides 
flanking each of the 5' and 3' ends of SEQ ID NOS: 1, 3, 5, 7 or 7. In certain 
embodiments, the plexin polypeptide has an amino acid sequence that comprises SEQ ID 

15 NOS: 2, 4, 6, 8 or 10. Preferably, the plexin polypeptide of the invention has not more 
than 50 amino acid residues flanking each of the N-terminal and C-terminal ends of SEQ 
ID NOS: 2, 4, 6, 8 or 10. 

In certain embodiments of the first aspect of the invention, the nucleic acid 
molecule hybridizes under stringent conditions (as defined herein) to SEQ ID NOS: 1,3, 

20 5, 7 or 9. 

The invention also includes nucleic acid molecules that hybridize under 
stringent hybridization conditions (as defined herein) to all or a portion of the 
nucleotide sequence represented by SEQ ID NOS: 1, 3, 5, 7 or 9 or its complement. The 
hybridizing portion of the hybridizing nucleic is at least 80%, e.g., at least 95%, or at 
25 least 98%, homologous to the sequence of a portion or all of a nucleic acid encoding a 
polypeptide having the amino acid sequence of SEQ ID NOS: 2, 4, 6, 8 or 10, or its 
complement. Hybridizing nucleic acids of the type described herein can be used, for 
example , as a cloning probe, a primer (e.g., a PCR primer) or a diagnostic probe. 

Hybridization of the oligonucleotide probe to a nucleic acid sample typically is 
30 performed under stringent conditions. Nucleic acid duplex or hybrid stability is 

expressed as the melting temperature or Tm, which is the temperature at which a probe 
dissociates from a target DNA. This melting temperature is used to define the required 
stringency conditions. If sequences are to be identified that are related and substantially 
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identical, rather than identical, then it is useful to first establish the lowest temperature 
at which only homologous hybridization occurs with a particular concentration of salt 
(e.g., SSC or SSPE). Then, assuming that 1% mismatch results in a 1°C decrease in the 
Tm, the temperature of the final wash in the hybridization reaction is reduced 
5 accordingly (for example, if the sequences have > 95% identity with the probe are 

sought, the final wash temperature is decreased 5°C). In practice, the change in the Tm 
can be between 0.5 C and 1.5 C per 1% mismatch. "Stringent conditions" involve 
hybridization at 68°C in 5x SSC/5x Denhardt's solution/1.0% SDS, and washing in 
0.2xSSC/0.1% SDS at room temperature. "Moderately stringent conditions" include 

10 washing in 3xSSC at 42°C. The parameters of salt concentration and temperature can 
be varied to achieve the optimal level of identity between the probe and the target 
nucleic acid. Additional guidance regarding such conditions is readily available in the 
art, for example, by Sambrook et al, supra; and Ausubel et al, supra. 

Nucleic acid sequence homology (as well as amino acid sequence homology) can 

15 be measured according to standard methods. Unless otherwise specified, as used herein 
used herein, "percent homology" of two amino acid sequences or of two nucleic acids is 
determined using the algorithm of Karlin and Altshul (Proc. Natl. Acad. Sci. USA 87: 
2264-2268, 1990), modified as in Karlin and Altschul (Proc. Natl. Acad. Sci. USA 90: 
5873-5877, 1993). Such an algorithm is incorporated into the NBLAST and XBLAST 

20 programs of Altschul et al. (J. Mol. Biol. 215: 403-410, 1990). BLAST nucleotide 

searches are performed with the NBLAST program, e (score) = 100, word length = 12, to 
obtain nucleotide sequences homologous to a nucleic acid molecule of the invention. 
BLAST protein searches are performed with the XBLAST program, e (score) = 50, word 
length = 3, to obtain amino acid sequences homologous to a reference polypeptide (e.g., 

25 SEQ ID NO: 2). To obtain gapped alignments for comparison purposes, Gapped BLAST 
is utilized as described in Altschul et al. (Nucleic Acids Res. 25: 3389-3402, 1997). 
When utilizing BLAST and Gapped BLAST programs, the default parameters of the 
respective programs (e.g., XBLAST and NBLAST) are used, namely e=10; w=l 1 for 
nucleic acid; w=3 for amino acid (the Blosum 62 scoring matrix); low complexity 

30 sequence filtering. The default settings of BLAST emphasize regions of local alignment 
to detect relationships among sequences which share only isolated regions of similarity 
(Altschul et al., J. Mol. Biol. 215: 403-410 (1990). See http://www.ncbi.nlm.nih.gov . 
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Thus, in a non-limiting example to obtain an isolated nucleic acid molecule 
encoding a plexin polypeptide, a nucleic acid molecule having the sequence of SEQ ID 
NOS: 1, 3, 5 or 7 is used to probe a cDNA library under stringent conditions according to 
standard techniques (see., e.g., Ausubel et ah, supra). Upon identification of a positive 
5 clone (i.e., a clone that hybridizes to SEQ ID NOS: 1, 3, 5 or 7 under stringent 

conditions), that clone is expanded and subjected to sequence analysis. A nucleic acid 
molecule having a nucleic acid sequence that is at least 70% identical, preferably at least 
75% identical, more preferably, at least 80% identical, still more preferably at least 85% 
identical, even more preferably, at least 90% identical, and most preferably at least 95% 
10 identical (as measured by the basic BLAST program of NCBI on default settings) to SEQ 
ID NOS: 1, 3, 5 or 7 is a nucleic acid molecule of the invention. 

In a second aspect, the invention provides four novel isolated plexin polypeptides. 
"Isolated" is as defined for the first aspect of the invention. Where the isolated 
compound is a polypeptide, the isolated polypeptide is separated from organic molecules, 
15 such as nucleic acid molecules, polypeptides, and/or carbohydrates, which, in the 

naturally-occurring organism from which the polypeptide was derived, accompany the 
polypeptide. Isolated polypeptides therefore also include a recombinant polypeptide 
(e.g., a human polypeptide expressed in an insect cell), or a chemically synthesized 
polypeptide. Purification of a polypeptide can be accomplished and measured by any 
20 standard method including, without limitation, chemical synthesis, recombinant 
polypeptide expression in prokaryotic or eukaryotic cells, affinity chromatography, 
Western blotting analysis, SDS-PAGE analysis, and/or HPLC. 

In accordance with this aspect, the invention provides all derivatives, mutants, 
truncations, and/or splice variants of the four novel plexin polypeptides, so long as these 
25 derivatives, mutants, truncations, and/or splice variants share at least 60% amino acid 
sequence homology with SEQ ID NOS: 2 ,4 ,6 or 8, preferably, at least 65% sequence 
homology, more preferably, at least 70% sequence homology, more preferably, at least at 
least 75% sequence homology, more preferably, at least 80% sequence homology, still 
more preferably at least 85% sequence homology, even more preferably, at least 90% 
30 sequence homology, and most preferably at least 95% sequence homology with SEQ ID 
NOS: 2 ,4 ,6 or 8 as determined using the basic BLAST program of the National Center 
for Biotechnology (NCBI; National Library of Medicine, Bethesda, MD), using the 
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default settings defined therein using the sequence of the four novel plexin derivative, 

mutant, truncation and/or splice variance as the probe. 

Preferred plexin polypeptide derivatives include polypeptides whose sequences 

differ from the sequence given in SEQ ID NOS: 2 ,4 ,6 or 8, by one or more 
5 conservative amino acid substitutions, or by one or more non-conservative amino acid 

substitutions, deletions or insertions which do not abolish the biological activity of the 

plexins. Conservative amino acid substitutions typically include the substitution of one 

amino acid for another with similar biochemical characteristics, such as polarity, size, 

and/or charge. Non-limiting examples of conservative substitutions are substitutions 
10 within the following groups: valine, glycine, glycine, alanine, valine, isoleucine, 

leucine; aspartic acid, glutamic acid, asparagine, glutamine; serine, threonine; lysine, 

arginine, phenylalanine, and tyrosine. 

Useful methods for mutagenesis to generate plexin mutants are known in the art 

(see, e.g., Sambrook et ah, supra; Ausubel et al, supra). Preferred methods for 
15 mutagenesis are described in PCT Publication W099/1 2965 and include PCR 

mutagenesis and saturation mutagenesis. A library of random amino acid sequence 

variants can also be generated by the synthesis of a set of degenerate oligonucleotide 

sequences. 

In certain embodiments of the second aspect of the invention, the plexin 
20 polypeptide has a sequence comprising the sequence of SEQ ID NOS: 2 ,4 ,6 or 8. In 
one non-limiting example, in accordance with the invention, an isolated plexin 
polypeptide comprising the sequence of SEQ ID NOS: 2 ,4 ,6 or 8 can chemically 
synthesized according to standard techniques (e.g., at a commercial peptide generating 
facility). 

25 For example, a putative plexin polypeptide is purified and subjected to N- 

terminal sequencing to determine its amino acid sequence. The amino acid sequence of 
the polypeptide is then compared to SEQ ID NOS: 2 ,4 ,6, 8 or 10 (as measured by the 
basic BLAST program of NCBI on default settings). A polypeptide that shares at least 
60% homology with SEQ ID NOS: 2 ,4 ,6, 8 or 10 is a plexin polypeptide of the 

30 invention. 

In another example, purification of a plexin polypeptide is facilitated by the 
addition of a tag to the polypeptide that enables purification of the tagged polypeptide. 
Non-limiting examples of a tag include a hemagglutinin (HA) tag, a his tag, a GST tag, a 
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FLAG-tag, and a myc tag. Thus, a nucleic acid molecule of the first aspect is engineered 
using standard molecular biology techniques to incorporate the nucleic acid sequence 
encoding the tag. The engineered nucleic acid molecule is then introduced and 
positioned for expression in an appropriate cell to produce the recombinant tagged 
5 polypeptide, which can then be readily purified by binding of the tag to its substrate. For 
example, the his tag binds to Ni-NTA agarose. Likewise, a GST (glutathione S- 
transferase) tag binds to glutathione agarose beads. Both his tag and GST tag expression 
and purification kits are commercially available from PharMingen (San Diego, CA). 
Likewise, myc-tagged plexin polypeptide are produced by cells introduced with a nucleic 

10 acid molecule encoding the tagged protein and positioned for expression in the cell. 

It will be appreciated that particularly useful polypeptides of this aspect of the 
invention are secreted by the cell in which they are produced, thus facilitating 
purification of the polypeptide from the culture media in which the cells have been 
maintained, without requiring lysis of the cell. 

15 In a third aspect, the invention provides a cell engineered to comprise a nucleic 

acid molecule encoding one of the four plexin polypeptides. By "engineered" is meant 
that the cell of the invention has been modified by standard molecular biology 
techniques. Where the cell is "engineered to comprise a nucleic acid molecule," standard 
molecular biology techniques have been employed to introduce the indicated nucleic acid 

20 molecule into the cell, either by transformation or transfection of the cell with a plasmid, 
or by infection or transduction of the cell with a recombinant virus. 

The nucleic acid molecule of the first aspect of the invention is preferably 
subcloned into a plasmid or vector (for example, but not limited to, a vector used to 
generate a recombinant virus), wherein the nucleic acid molecule is positioned for 

25 expression in the plasmid or vector. The plasmid or vector is then introduced into a cell 
by standard techniques to produce an engineered cell in accordance with the third aspect 
of the invention. 

In certain embodiments of the third aspect, the cell is a prokaryotic cell (e.g., a 
bacterium). For example, E. coli cells (e.g., DH5a) are transformed (using, e.g., 
30 electroporation) with a bacterial plasmid (i.e., a plasmid containing an E. coli origin of 
replication) containing a nucleic acid molecule of the first aspect of the invention. The 
transformed bacteria are selected using, for example, an antibiotic-resistance encoding 
nucleic acid molecule (e.g., ampicillin resistance) on the plasmid such that the antibiotic 
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resistance protein is expressed by the transformed bacteria. The transformed bacteria are 
then propagated, and can be cryopreserved and stored frozen in glycerol. 

Those of skill in the art will appreciate that in accordance with the third aspect of 
the invention, a nucleic acid molecule encoding one of the four plexin polypeptides may 
5 be introduced into a large variety of cells. For example, a nucleic acid molecule 
encoding one of the four plexin polypeptides can be introduced into prokayotic cells 
(e.g., bacteria), and any eukaryotic cell into which an exogenous nucleic acid molecule 
may be introduced. Thus, in certain embodiments of the third aspect of the invention, the 
cell is a eukaryotic cell. Eukaryotic cells according to this aspect of the invention that 
10 comprise a nucleic acid molecule encoding one of the four plexin polypeptides include, 
without limitation, yeast cells, plant cells, insect cells, and mammalian cells. Within the 
category of mammalian cells are cells from any mammalian species (including, without 
limitation, mouse, hamster, monkey, human), of any lineage (including, without 
limitation, lymphocyte, fibroblast, stem cell), and may be an immortalized cell, or a non- 
15 immortalized cell. Cells, as well as plasmids and/or vectors (e.g., vectors that can be 
packaged to form infectious virus particles), are commercially available, for example, 
from the American Type Culture Collection ("ATCC"; Manassas, VA). 

In certain embodiments of the third aspect of the invention, the nucleic acid 
molecule is positioned for expression in the cell. By "positioned for expression" is 
20 meant that the nucleic acid molecule is operably linked to at least one regulatory 

sequence which directs the transcription and translation of the nucleic acid molecule in a 
cell, such that the cell engineered to comprise the nucleic acid molecule produces (i.e., 
expresses) the protein encoded by the nucleic acid molecule. By "operably linked" is 
meant that the nucleic acid molecule and the regulatory sequence are connected in a such 
25 a way as to permit expression of the nucleic acid molecule when the nucleic acid 
molecule is present in a cell. Regulatory sequences include, without limitation, 
promoters, enhancers, IRES sequences, and polyadenylation signals. Since plexin 
polypeptides are involved in immune regulation and the modulation of cell growth, it 
may be desirable to operably link a nucleic acid molecule encoding one of the four plexin 
30 polypeptides to an inducible promoter. 

The four plexin polypeptides that are encoded by the nucleic acid molecules do 
not necessarily include the transmembrane domain of the four plexin polypeptides, and 
so may be produced by the cell as an intracellular polypeptide or a soluble secreted 
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polypeptide. For example, if the polypeptide fragment is secreted by the cell, it can be 
purified from the conditioned growth media of the transfected cells, without having to 
lyse the cells. Likewise, although a soluble intracellular polypeptide fragment is purified 
from only lysed cells, the fragment, being soluble, does not have to be extracted from the 

5 cell membrane; thus, different lysis conditions may be used to obtain purified soluble 
intracellular polypeptide fragment as compared to the lysis conditions required to obtain 
purified full length plexin polypeptides (which has a transmembrane domain). 

Protein expression systems have been established for a variety of cells and are 
known to those of skill in the art. Cells are also commercially available from the ATCC, 

10 and a variety of protein expression system kits are commercially available from, for 
example, Invitrogen Corp. (Carlsbad, CA), Clontech Laboratories (Palo Alto, CA), 
PharMingen (San Diego, CA), Promega Corp. (Madison, WI), and Stratagene (La Jolla, 
CA). 

For example, a nucleic acid molecule encoding one of the four plexin 

15 polypeptides is operably linked to bacterial regulatory sequences (e.g., T7 late promoter 
or bacteriophage regulatory sequences), and the resulting nucleic acid molecule is used to 
transform bacterial cells, where the transformed bacterial cells produce one of the four 
plexin polypeptides. In another example, a nucleic acid molecule encoding one of the 
four plexin polypeptides is operably linked to baculovirus regulatory sequences in a 

20 baculovirus vector. Recombinant baculovirus are then generated and used to transduce 
insect cells (using, for example, the expression kit commercially available from Clontech 
Laboratories. The transduced insect cells comprise a nucleic acid molecule encoding one 
of the four plexin polypeptides positioned for expression in the insect cell, and thus 
produce one of the four plexin polypeptides. 

25 Mammalian cells are widely used as protein expression systems. For example, a 

mammalian cell may be transduced with a recombinant retrovirus or adenovirus 
comprising a nucleic acid molecule encoding one of the four plexin polypeptides 
operably linked to regulatory sequences that are either endogenous to the particular virus 
or exogenous to the virus (e.g., a CMV promoter in a retroviral vector). The transduced 

30 mammalian cell is then propagated in vitro in tissue culture, in vivo (e.g., in an 
immunocompromised animal), and/or cryopreserved and stored frozen in DMSO. 

In another example, mammalian cells are transfected with an expression plasmid 
comprising a nucleic acid molecule encoding one of the four plexin polypeptides 
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operably linked to one or more regulatory sequences on the plasmid. By "expression 
plasmid" is meant a plasmid in which an inserted nucleic acid molecule of interest (e.g., 
encoding one of the four plexin polypeptides, a plexin chimeric molecule, or tagged 
plexin polypeptide) is operably linked to at least one regulatory sequence such that when 

5 the expression plasmid containing the inserted nucleic acid molecule of interest is 
introduced (e.g., by transfection) into a cell, the inserted nucleic acid molecule is 
positioned for expression in that cell. The nucleic acid molecule in the expression 
plasmid, upon being introduced into the cell, is thus positioned for expression in that cell, 
and enables the cell to produce one of the four plexin polypeptides encoded by the 

10 nucleic acid molecule. 

In one non-limiting example, a nucleic acid molecule according to the first aspect 
of the invention is inserted into a standard mammalian expression plasmid (e.g., 
pcDNA3.1 commercially available from Invitrogen Corp., Carlsbad, California), such 
that the inserted nucleic acid molecule encoding one of the four plexin polypeptides is 

15 operably linked to the regulatory sequences in the mammalian expression plasmid. 

Mammalian cells are then transfected with this expression plasmid (using, e.g., CaP0 4 or 
DEAE-dextran). Where the expression plasmid contains an antibiotic-resistance 
encoding nucleic acid molecule (e.g., neomycin resistance on the pCDNA3.1 plasmid) 
such that the antibiotic resistance protein is expressed by the transfected cells, transfected 

20 cells may be selected for the ability to grow in the presence of the antibiotic. The 
transfected cells may then be propagated and cryopreserved and stored in frozen in 
DMSO. 

In a fourth aspect, the invention provides an isolated nucleic acid molecule 
encoding a chimeric molecule comprising at least two segments, wherein one of the 

25 segments comprises one of the four plexin polypeptides. By "chimeric molecule" is 
meant a protein that comprises at least two segments of polypeptide joined together by 
any means, including, without limitation, a covalent bond (e.g., peptide bond), a non- 
covalent bond (e.g., ionic bond or hydrogen bond) or by a chemical crosslinker. It should 
be noted that one of the four plexin polypeptides that has been tagged is within the 

30 definition of a chimeric molecule. 

In certain embodiments of the fourth aspect of the invention, the nucleic acid 
molecule encoding the segment of a chimeric molecule comprising one of the four plexin 
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polypeptides hybridizes under stringent conditions to SEQ ID NO: 1, 3, 5 or 7. 
"Stringent conditions" are as described above for the first aspect of the invention. 

Standard molecular biology techniques may be employed to generate nucleic acid 
molecules encoding chimeric molecules according to the fourth aspect of the invention. 

5 For example, a nucleic acid molecule encoding the extracellular domain of one of the 

four plexin polypeptides may be joined, in frame, to a nucleic acid molecule encoding the 
constant region of an immunoglobulin molecule (see, e.g., Chamow S.M., Antibody 
Fusion Proteins, John Wiley & Sons, New York, 1999). By "in frame" is meant that a 
first nucleic acid molecule is ligated to a second nucleic acid molecule such that the each 

10 of the amino acid sequences of the polypeptides encoded by each of the first and the 
second nucleic acid molecules is not frame-shifted. 

In one non-limiting example, a chimeric molecule comprising the extracellular 
domain of one of the four plexin polypeptides including the amino acid sequence of SEQ 
ID NOS: 2, 4, 6 or 8 is generated. In this example, a nucleic acid molecule encodes 

15 amino acid residue number 1(19) through about amino acid residue number 1 199 of SEQ 
ID NO: 2; amino acid residue number 1(24) through about amino acid residue number 
1099 of SEQ ID NO: 4; amino acid residue number 1(43) through about amino acid 
residue number 1270 of SEQ ED NO: 6 and amino acid residue number 1 through about 
amino acid residue number 199 of SEQ ID NO: 8 with its associated signal peptide 

20 (parenthesis depicts about the beginning of the amino acid sequence of the extracellular 
domain lacking its signal peptide). This nucleic acid molecule is fused in frame with a 
nucleic acid molecule encoding the constant region of an immunoglobulin, such that the 
chimeric molecule encoded by the resulting nucleic acid molecule generally has the 
following structure: 

25 



N-terminus 


extracellular domain of SEQ ID 
NO: 2 with or lacking its signal 
peptide 


amino acids from the constant 
region of an Ig molecule 
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N-terminus 


extracellular domain of SEQ ID 
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peptide 
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region of an Ig molecule 


C-terminu; 


N-terminus 


extracellular domain of SEQ ID 
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region of an Ig molecule 
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I 1 peptide I 1 

The heavy chain class {e.g., IgG, IgA, IgM, IgD, or IgE) can be varied in this chimeric 
molecule depending upon which constant region is used. Nucleic acid molecules 
encoding the constant region of various immunoglobulin (Ig) heavy chains are known 

5 (see, e.g. , Zettlmeissl et al., DNA Cell Biol. 9(5):347-53, 1990) Indeed, expression 
plasmids are available, into which the nucleic acid molecule of interest (i.e., a nucleic 
acid molecule encoding an extracellular domain of the polypeptide of SEQ ID NO: 2; 
SEQ ID NO: 4; SEQ ID NO: 6; or SEQ ID NO: 8) can be inserted, and the resulting 
plasmid introduced into a cell to produce one of the four extracellular plexin-Ig chimeric 

10 molecule s (see, e.g., Zettlmeissl et al., supra; Miller et al., J. Exp. Med. 178 (1): 211- 
222, 1993). 

Any variety of chimeric molecule carrying the extracellular domains of one of the 
four plexin polypeptide may be generated. For example, the extracellular domain of one 
of the four plexin polypeptides can be myc-tagged, his-tagged, or FLAG tagged 

15 according to standard molecular biology techniques. 

Such extracellular proteins are particularly useful for identifying ligands to which 
the extracellular domain of one of the four plexin polypeptides bind. For example, 
extracellular plexin-Dl-Ig chimera can be immobilized on a protein A-sepharose column. 
Molecules suspected of binding the extracellular domain of plexin-Dl are added to the 

20 column, to which the molecule that binds to the extracellular domain of plexin-Dl 
adhere, and the non-binding molecules flow through the column. The extracellular 
plexin-Dl -binding molecules are readily eluted, for example, by changing the pH or ion 
concentration of the elution buffer. 

Extracellular plexin proteins are also used to identify cells expressing the ligand 

25 of plexin extracellular domain on their cell surface (and thereby also identify the ligand 
itself). For example, cells are incubated with a FLAG-tagged plexin extracellular domain 
chimeric molecule. A FLAG-tagged plexin extracellular domain chimeric molecule is 
generated. An anti-FLAG antibody that is detectably labeled is then added to the cells. 
By "detectably labeled" is meant any means for marking and identifying the presence of a 

30 molecule. Detectable labels include, without limitation, radioactive labels (e.g., 32 P or 
35 S) and fluorophore labels (e.g., FI1C, phycoerythrin, or rhodamine). For example, 
FtTC-labeled anti-FLAG antibodies are commercially available from Babco, Richmond, 
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CA. The "stained" cells (i.e., cells incubated first with the FLAG-tagged plexin 
extracellular domain chimeric molecule and then with the FITC-labeled anti-FLAG 
antibody), are then subjected to flow cytometry analysis to select those cells that are 
labeled with FITC, and so express a molecule that binds to the extracellular domain of 
5 one of the four plexin polypeptides. The FTTC labeled cells are then further manipulated 
(e.g., characterized to determine which cells express the plexin polypeptide ligand). 

The ligand of the plexin extracellular domain is itself identified, for example, by 
lysing the cells, adding the lysate to one of the four plexin extracellular domain-Ig 
chimeric molecule columns described above, and purifying the ligand. The ligand is then 

10 sequenced by N-terminal sequencing. 

In another non-limiting example, the intracellular domain of one of the four 
plexin polypeptides is used as "bait" in a yeast two-hybrid system to identify ligands that 
interact with the intracellular domain of one of the four plexins described herein. For 
general description of the two-hybrid system, see U.S. Patent Nos. 5,283,173; 5,468,614; 

15 and 5,695,941. In this example, a nucleic acid molecule encoding from about amino acid 
residue number 143 through at least amino acid residue number 214 of SEQ ID NO: 2 is 
inserted into a standard DNA binding domain expression plasmid (e.g., the GAL4 DNA 
binding domain plasmid in the Interactor kit commercially available from PharMingen 
(San Diego, CA). (It will be understood that the nucleic molecule may encode amino 

20 acid residue number 138-148 through at least amino acid residue number 214 of SEQ ED 
NO: 2.) A variety of cDNA libraries in transcriptional activation domain vectors are 
available (e.g., from Clontech, Palo Alto CA). The cDNA libraries are screened 
employing standard methods (see, e.g., the methods employed in U.S. Patent No. 
5,780,262) to identify cDNA clones encoding a ligand that binds to the intracellular 

25 domain of one of the four plexin polypeptides. One preferable cDNA library screened in 
this example is a cDNA library generated from an immune cell (e.g., a lymphocyte or NK 
cell). 

In a fifth aspect, the invention provides a purified chimeric molecule comprising 
one of the four plexin polypeptides. Methods for purifying proteins are as described for 
30 the second aspect of the invention. 

In a sixth aspect, the invention provides a cell engineered to comprise a nucleic 
acid molecule encoding a chimeric molecule comprising at least two segments, wherein 
one of the segments comprises one of the four plexin polypeptides. As described for the 
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third aspect of the invention, a nucleic acid encoding a chimeric molecule comprising 
one of the four plexin polypeptides may be introduced into any variety of cells. In certain 
embodiments, the cell is a prokaryotic cell or a eukaryotic cell. In certain embodiments, 
the eukaryotic cell is a yeast cell or a mammalian cell (e.g., a human cell). 
5 In a seventh aspect, the invention provides an isolated binding agent that 

specifically binds one of the four plexin polypeptides, or specifically binds a chimeric 
molecule comprising a segment comprising one of the four plexin polypeptides. In 
certain embodiments, the plexin protein has an amino acid sequence comprising SEQ ID 
NOS:2, 4, 6 or 8. 

10 By "specifically binds" is meant a binding agent (e.g., an antibody) that binds to 

its specific target (e.g., one of the four plexin polypeptides) with greater affinity than it 
binds to other molecules. Preferably, where the binding agent is an antibody, the 
antibody preferably specifically binds to its specific target with a dissociation constant 
(K D) of at least 10" 5 M, more preferably, 10" 6 M, even more preferably 10" 7 M, and most 

15 preferably, the binding agent specifically binds to its specific target with a K D of at least 
10- 8 M. 

Preferably, the binding agent of this aspect of the invention is an antibody, such 
as a monoclonal antibody or a polyclonal antibody, or a fragment of an antibody that 
specifically binds one of the four plexin polypeptides. Standard methods may be 

20 employed to generated both monoclonal and polyclonal antibodies that specifically bind 
to one of the four plexin polypeptides of the invention. See, e.g., Ausubel et al., supra; 
Coligan, J.E. et al., Current Protocols in Immunology , John Wiley & Sons, New York 
(1991); and Delves, P.J., Antibody Production: Essential Techniques , John Wiley & 
Sons, New York (1997). Briefly, the plexin polypeptides of the present invention, 

25 purified according to the methods described for the second aspect of the invention, are 
used to immunize rabbits (e.g., for polyclonal antibodies) or mice (e.g., for monoclonal 
antibodies) to generate antibody-mediated immunity to the four plexin polypeptides used 
to immunize the animal. For monoclonal antibodies, antibodies can be screened by, e.g., 
ELISA, to identify those antibodies that show the highest affinity for the immunizing 

30 plexin protein of polypeptide fragment. The cloned cell producing the high affinity 
monoclonal antibody can then propagated in vitro (where the antibody is purified from 
the culture supernatant) or in vivo (where the antibody is purified from ascites fluid), and 
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can also be cryopreserved and stored frozen at, e.g., -70°C in DMSO, to provide a 
potentially limitless supply of monoclonal antibody. 

In addition to intact monoclonal and polyclonal antibodies, the invention also 
provides various antibody fragments, such as Fab, F(ab') 2 , Fv, and sFv fragments. 

5 Recombinant, chimeric, and humanized antibodies are also provided. 

Recombinant "humanized antibodies" which specifically bind to one of the four 
plexin polypeptides can be synthesized according to methods known in the art (see, e.g., 
Green L.L. et ah, Nature Genetics 7: 13-21, 1994 for fully humanized antibodies 
expressed in transgenic animals; see also U.S. Patent Nos: 5,693,761; 5,777,085; and 

10 5,585,089). Humanized antibodies are chimeras comprising mostly human IgG 

sequences into which at least portions of the regions responsible for specific antigen- 
binding {e.g., CDR's) have been inserted. Animals are immunized with the desired 
antigen, the corresponding antibodies are isolated, and the portion of the variable region 
sequences responsible for specific antigen binding are removed. The animal-derived 

15 antigen binding regions are then cloned into the appropriate position of human antibody 
genes in which the antigen binding regions have been deleted. Humanized antibodies 
minimize the use of heterologous (i.e., inter-species) sequences in human antibodies, 
and thus are less likely to elicit immune responses in the treated subject (see also, e.g., 
U.S. Patent No. 5,807,715). 

20 Construction of different classes of recombinant antibodies can also be 

accomplished by making chimeric or humanized antibodies comprising nonhuman 
variable domains and human constant domains (CHI, CH2, CH3) isolated from 
different classes of immunoglobulins. For example, antibodies with increased antigen 
binding site valencies can be recombinantly produced by cloning the antigen binding 

25 site into vectors carrying the human chain constant regions (see, e.g., Arulanandam et 
al.,J. Exp. Med. 177: 1439-1450, 1993). 

In addition, standard recombinant DNA techniques can be used to alter the 
binding affinities of recombinant antibodies with their antigens by altering amino acid 
residues in the vicinity of the antigen binding sites. The antigen binding affinity of a 

30 humanized antibody can be increased by mutagenesis based on molecular modeling 
(see, e.g., Queen et al., Proc. Natl. Acad. Sci. 86: 10029-10033, 1989). 

Also provided in the invention are plexin polypeptide-specific single 
polypeptide chain antibodies (see general methods in U.S. Patent Nos. 4,946,788 and 
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4,704,692); single domain antibodies (Ward, E.S. et al., Nature 341: 544-546, 1989); 
and chimeric antibodies (U.S. Patent No. 4,816,567). 

Binding agents that specifically bind the plexin polypeptides of the present 
invention are useful, for example, in determining expression levels of the plexin 
5 polypeptides in various tissues of the body, Western blotting analysis, and 

immunochromatography. Particularly, binding agents that specifically bind the plexin 
polypeptides are useful for binding the plexin polypeptide on a cell expressing the 
plexin polypeptide, thereby activating the cell. 

A binding agent that specifically binds one of the four plexin polypeptides, for 
10 example, is effective as an immune modulator. Additional applications include, without 
limitation, an injectable formulation comprising a binding agent that specifically binds 
one of the four plexin polypeptides that is useful to antagonize activity in a disease 
involving aberrant immune regulation or a disease involving aberrant cell growth. 

In an eighth aspect, the invention provides an isolated antisense oligonucleotide 
15 complementary to a portion of a nucleic acid molecule encoding one of the four plexin 
polypeptides. In certain embodiments, hybridization of the antisense oligonucleotide to 
the nucleic acid molecule inhibits transcription or translation of the nucleic acid 
molecule. 

By two nucleic acid molecules being "complementary" to one another is meant 
20 that the first nucleic acid molecule {e.g., an oligonucleotide) is able to form Watson- 
Crick base pair hydrogen bonds (i.e., hybridize) with the second nucleic acid molecule to 
form a duplex. The first nucleic acid molecule is thus a "complement" of the second 
nucleic acid molecule. 

The antisense oligonucleotides according to the invention are complementary to a 
25 region of a nucleic acid molecule (or a region at the intron/exon boundary of DNA or 
RNA) that encodes one of the four plexin polypeptides. Preparation of antisense 
oligonucleotides is well known (see, e.g., Agrawal et ah, Trends Biotechnol. 10:152-158, 
1992; U.S. Patent No. 5,149,798; Agrawal et al, Proc. Natl. Acad. Sci. USA 85:7079- 
7083, 1988; Froehler, Tetrahedron Lett. 27:5575-5578, 1986; and Bergot et al., J. 
30 Chromatog. 559:35-42, 1992. 

In a ninth aspect, the invention provides a method for identifying a nucleic acid 
molecule encoding one of the four plexin polypeptides, comprising contacting a pool of 
candidate nucleic acid molecules with a nucleic acid molecule encoding one of the four 
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plexin polypeptides, wherein hybridization of the nucleic acid molecule encoding one of 
the four plexin polypeptides under stringent conditions to a candidate nucleic acid 
molecule identifies the candidate nucleic acid molecule as a nucleic acid molecule that 
encodes one of the four plexin polypeptides. According to this aspect of the invention, 

5 "hybridization" and "stringent conditions" are as defined above for the first aspect of the 
invention. In certain embodiments, the nucleic acid molecule encoding one of the four 
plexin polypeptides has a nucleic acid sequence comprising SEQ ID NOS: 1, 3, 5 or 7. 

It will be understood that the isolated plexin polypeptides according to the second 
aspect of the invention, the plexin chimeric molecules according to the fifth aspect of the 

10 invention, and binding agents that specifically bind the plexin polypeptides according to 
the seventh aspect of the invention, are useful as therapeutics to treat an individual 
suffering from, or suspected of having, a disease or disorder involving aberrant immune 
regulation or an individual suffering from, or suspected of having, a disease or disorder 
involving aberrant cell growth, particularly nerve cell growth. 

15 By "disease or disorder involving aberrant immune regulation" is meant any 

disease or disorder in which an abnormal immune response is generated in response to 
either self or foreign antigens. Thus, this definition includes, without limitation, 
autoimmune diseases (e.g., lupus, inflammatory bowel disease, or Diabetes Type 1) and 
immunosuppressive diseases (e.g., multiple sclerosis or rheumatoid arthritis). 

20 By "disease or disorder involving aberrant cell growth" is meant any disease or 

disorder in which an abnormal amount of cell growth is observed. "Cell growth" is 
defined above. Thus, diseases and disorders involving aberrant cell growth include 
hyperplasia, neoplasia, and cancer, as well as degenerative diseases, such as 
neurodegenerative diseases. 

25 Preferable therapeutically useful plexin polypeptides are soluble polypeptides 

(e.g., lacking the hydrophobic transmembrane domain of the plexin polypeptides), 
particularly soluble polypeptide fragments that are secreted by the cell in which the 
fragment was produced. In a preferred embodiment the soluble plexin polypeptides are 
selected from the group consisting of plexin-A-1 (Maestrini et al. 1996 supra), plexin-A- 

30 2 (Maestrini et al. 1996 supra), plexin-A-3 (Maestrini et al. 1996 supra), plexin-A-4, 
plexin-B-1 (Maestrini et al. 1996, supra), plexin-B-2, plexin-B-3, plexin-Cl (Comeau et 
al. 1998 supra), plexin-D-1. 
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In a tenth aspect, the invention provides a method for diagnosing a disease 
involving aberrant immune regulation or a disease involving aberrant cell growth, 
comprising comparing the amino acid sequence of one of the four plexin polypeptides 
from an individual suspected of having the disease with the amino acid sequence of one 
5 of the four plexin polypeptides from an unaffected individual, wherein the presence of a 
difference between the two amino acid sequences identifies the individual suspected of 
having the disease as having the disease. "Disease or disorder involving aberrant 
immune regulation" and "disease or disorder involving aberrant cell growth" are as 
defined above. 

10 By "difference" in the amino acid sequence of one of the four plexin polypeptides 

from an individual suspected of having the disease or disorder as compared with the 
amino acid sequence of one of the four plexin polypeptides from an unaffected 
individual, is meant any mutation that changes the amino acid sequence including 
substitution, deletion, of addition of one or more amino acid residues. 

15 Thus, in one nonlimiting example, one of the four plexin polypeptides is 

extracted from cells of an individual suspected of having a disease involving aberrant 
immune regulation (e.g., using an antibody according to the seventh aspect of the 
invention). The amino acid sequence of the plexin polypeptide is determined by N- 
terminal sequencing and compared to the amino acid sequence of one of the four plexin 

20 polypeptides from an unaffected individual (i.e., a normal individual of the same species 
that does not have a disease involving aberrant immune regulation or a disease involving 
aberrant cell growth). If there is a difference in the two amino acid sequences, the 
individual suspected of having a disease involving aberrant immune regulation is 
identified as having a disease involving aberrant immune regulation, and may be treated 

25 accordingly. 

In certain embodiments of the tenth aspect, the amino acid sequence of the plexin 
polypeptide from the unaffected individual comprises the sequence of SEQ ID NO: 2, 4, 
6, 8 or 10. 

The following examples are intended to further illustrate certain preferred 
30 embodiments of the invention and are not limiting in nature. Those skilled in the art 
will recognize, or be able to ascertain, using no more than routine experimentation, 
numerous equivalents to the specific substances and procedures described herein. Such 
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equivalents are considered to be within the scope of this invention, and are covered by 
the following claims. 

Practice of the present invention will employ, unless indicated otherwise, 
conventional techniques of cell biology, cell culture, molecular biology, microbiology, 

5 recombinant DNA, protein chemistry, and immunology, which are within the skill of 
the art. Such techniques are described in the literature. See, for example, Molecular 
Cloning: A Laboratory Manual, 2nd edition. (Sambrook, Fritsch and Maniatis, eds.), 
Cold Spring Harbor Laboratory Press, 1989; DNA Cloning, Volumes I and II (D.N. 
Glover, ed), 1985; Oligonucleotide Synthesis, (M.J. Gait, ed.), 1984; U.S. Patent No. 

10 4,683,195 (Mullis et al.,); Nucleic Acid Hybridization (B.D. Hames and SJ. Higgins, 
eds.), 1984; Transcription and Translation (B.D. Hames and S.J. Higgins, eds.), 1984; 
Culture of Animal Cells (R.I. Freshney, ed). Alan R. Liss, Inc., 1987; Immobilized 
Cells and Enzymes, IRL Press, 1986; A Practical Guide to Molecular Cloning (B. 
Perbal), 1984; Methods in Enzymology, Volumes 154 and 155 (Wu et al., eds), 

15 Academic Press, New York; Gene Transfer Vectors for Mammalian Cells (J.H. 

Miller and M.P. Calos, eds.), 1987, Cold Spring Harbor Laboratory; Immunochemical 
Methods in Cell and Molecular Biology (Mayer and Walker, eds.), Academic Press, 
London, 1987; Handbook of Experiment Immunology, Volumes I-IV (D.M. Weir and 
C.C. Blackwell, eds.), 1986; Manipulating the Mouse Embryo, Cold Spring Harbor 

20 Laboratory Press, 1986. 

The following Examples are provided to illustrate the present invention, and 
should not be construed as limiting thereof. 
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EXAMPLES 

Example 1 

Identification and cDNA cloning of plexitis and sequence analysis 

Since the coding sequences of human plexin-Bl (SEP), plexin-A2(OCT) and 

5 plexin-Al(NOV) were incomplete, we obtained the missing cDNA by RT-PCR; primers 
were designed by homology to orthologous murine sequences and corresponding ESTs. 
Updated database entries are X87904, X87831 and X87832, respectively. Partial cDNA 
of plexin-A4 was obtained by assembling five overlapping human ESTs (HGI THC 
Report: THC203425), deriving from chromosome 7 specific cDNA pools. Another EST 

10 from chr. 7 (clone 7B19F10) encodes the cytoplasmic domain of a plexin and 

presumably derives from the same gene as plexin-A4. Plexin-B2 cDNA was amplified 
by RT-PCR starting from the partial cDNA sequences of clones MM J (Shinoura, N., 
Shamraj, O.I., Hugenholz, H., Zhu, J.G., McBlack, P., Wamick, R., Tew, J.J., Wani, 
M.A., and Menon, A.G. (1995). Identification and partial sequence of a cDNA that is 

15 differentially expressed in human brain tumors. Cancer Lett 89, 215-221) and 

KIAA0315 (Genbank database); the genomic locus of SEP-B was identified due to its 
100% sequence identity with clone C22_31 1 from human chromosome 22. Plexin-B3 
coding sequence was identified in the genomic sequence of ALD locus on human 
chromosome Xq28, using the algorithms HEXON and GENIE. Plexin-Dl was similarly 

20 found in the genomic sequence of chromosome 3 (pac pDJ70i 11). The genomic 

sequence of plexin-Bl(SEP), in the region of the alternative splicing of the extracellular 

domain, was obtained using the following primers: sense 

5 ' GC AGC ACCTGTGC ACCC AC A AGGC3 ' and antisense: 

5 'TGCAGGCTGGACGGGAGGATGAGG3 ' . The common donor site is 

25 CCATCAG/gtgattgt (position 2028 from ATG); the alternative splice acceptor sites are: 
(i) cccccttcag/AGCCC, leading to the canonical plexin-Bl sequence, and (ii) 
ctcctctcag/GTGAT, leading to "plexin-Bl truncated" variant. All these new sequences 
were analyzed using the algoritms BLAST2, NETPHOS (phosphorylation prediction 
sites, by Nicolaj Blom), PH-PREDICT and Consensus Protein Secondary Structure 

30 prediction at IBCP. The phylogenetic tree was generated using A11A11 algorithm of the 
Darwin sequence analysis system (at CBRG). 

Example 2 
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Plexin cDNA expression constructs and protein analysis 

Cell transfections were carried out by Calcium phosphate and DEAE-dextran 
methods, using 5-10 ug of each cDNA (1-2 ug each in case of cotransfections). For 
transient transfections in COS and BOSC-23 cells the cDNA was cloned in pCDNA3 or 
5 derived expression plasmids (Invitrogen). MDCK stable transfectants for plexin-A3 
were obtained using pCEP4 expression plasmid (Invitrogen); the selection was done in 
the presence of Hygromicin-B (100-200 ug/ml). Plexin-A3 positive clones of MDCK 
cells were isolated from two independent transfections, and showed identical biological 
properties. Plexin and neuropilin expression constructs included a VSV- and myc-tag at 

10 the N' and C protein termini, respectively, detected by monoclonal antibodies anti- 
VSV-G (cat. V-5507, Sigma) and anti-cMyc-tag (cat. OP10-100UG, Calbiochem). 
"Plexin-Bl truncated" splice variant was expressed from a cDNA fragment isolated by 
RT-PCR and VSV-tagged at the N' terminus: the encoded amino acid sequence spans 
up to aa 676 (including the sema domain and two MRS motifs). "Plexin-Bl -sema" 

15 derives from a further deletion of the plexin-Bl extracellular domain, and exclusively 
includes the sema domain. "Plexin-Bl -Asema" protein mutant includes only the C 
terminal half of plexin-Bl extracellular domain, starting from amino acid 606, i.e. 
excluding sema domain and first MRS but including second and third MRS, 
transmembrane and intracellular domains. 

20 For immunoprecipitations, cells were lysed with EB buffer (20 mM Tris-HCl 

pH 7.4, 5 mM EDTA, 150 mM NaCl, 10% glycerol, 1% Triton X-100), in the presence 
of a cocktail of protease inhibitors and ImM Na-ortovanadate. Immunoprecipitations 
were performed at 4DC for 4h with the appropriate antibodies; high stringency washes 
were performed, in the presence of 1 M LiCl. 

25 For in vitro kinase assays, immunopurified proteins were incubated with kinase 

buffer (50 mM Hepes, 100 uM DTT, 5 mM MnCl 2 , 5 mM MgCl 2 ) in the presence of 
redivue 5 uCi [y- 3Z P] ATP (Amersham) for 10 minutes at 4°C in agitation. Samples 
were then submitted to SDS-PAGE and autoradiography, or analysed using a Phosphor- 
Imager system (Molecular Dynamics). Alkali treatment of the polyacrilamide gels was 

30 performed with 1M KOH for two hours at 55°C. 

Western blots were performed according to standard methods. Specific 
detection of phospho-tyrosines was done with PY20 MoAb (Trasduction laboratories). 
Final detection was done with ECL system (Amersham). 
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Example 3 

Semaphorin-SEAP binding assays 

Soluble forms of Semaphorin extracellular domains were expressed as chimeric 
molecules with placental Secreted Alkaline Phosphatase (SEAP) and harvested from 
5 the conditioned media of transiently transfected COS or BOSC-23 cells. Serum-free 
media were concentrated over 100 times using Centricon Plus-20 filters (Millipore) 
with a molecular weight cutoff of 100 kDa. The AP activity of these media was 
assessed as described (Flanagan, J.G. and Leder, P. (1990). The kit ligand: a cell surface 
molecule altered in steel mutant fibroblasts. Cell 63, 185-194); the specific activity of 

10 chimeric molecules was approx. 1000 U/mg. Concentrated Semaphorin-SEAP were 
diluted as appropriate in a Hepes buffered saline, additioned with 0.2% BSA, 0.1% 
NaN 3 , 5 mM CaCl 2 and 1 mM MgCl 2 (HBSBA). For binding assays, COS cells 
transiently transfected with plexins were seeded on 48 well plates to reach confluence, 
and then incubated with Semaphorin-SEAP preparations (approx 1-5 nM) for 90 

15 minutes at room temperature. The binding was detected as described (Flanagan and 

Leder, 1990). Binding experiments with plexin-Cl/VESPR were as described (Comeau, 
M.R., Johnson, R., DuBose, R.F., Petersen, M., Gearing, P., VandenBos, T., Park, L., 
Farrah, T., Buller, R.M., Cohen, J.I., Strockbine, L.D., Rauch, C, and Spriggs, M.K. 
(1998). A poxvirus-encoded semaphorin induces cytokine production from monocytes 

20 and binds to a novel cellular semaphorin receptor, VESPR. Immunity. 8, 473-482; He, 
Z. and Tessier-Lavigne, M. (1997). Neuropilin is a receptor for the axonal 
chemorepellent Semaphorin III. Cell 90, 739-751). 

For in vitro binding assays, plexin-B 1 was purified from cell extracts by 
immunoprecipitation with anti-VSV antibody. Extracts of mock-transfected cells were 

25 used as control samples. After washing, the immunocomplexes were incubated with 
serial dilutions of CD100-SEAP (prepared as above) for 2 hours at 4°C, in continuous 
agitation. Samples were then washed 3 times with HBSBA and the bound alkaline 
phosphatase activity was measured by colorimetric assay using p-nitro-phenyl- 
phosphate, as described (Flanagan and Leder, 1990). Scatchard analysis was done using 

30 Equilibrate (by GertJan C. Veenstra). 
Example 4 

In situ hybridization analysis 
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RNA in situ hybridization was performed essentially as described (He, Z. and 
Tessier-Lavigne, M. (1997). Neuropilin is a receptor for the axonal chemorepellent 
Semaphorin m. Cell 90, 739-751). Briefly cDNA fragments of plexin-Al, -A2, and - 
A3 were used to generate 35 S-labeled antisense and sense RNA probes, which were 
5 used for in situ hybridization histochemistry of cryostat sections of rat embryos. 
Example 5 

Xenopus turning assay 

The methods for injecting mRNA encoding various constructs, and for studying 

the turning responses of the neurons, are exactly as described previously (Hong, K., 
10 Hinck, L., Nishiyama, M., Poo, M.M., Tessier-Lavigne, M., and Stein, E. (1999). A 

ligand-gated association between cytoplasmic domains of UNC5 and DCC family 

receptors converts netrin-induced growth cone attraction to repulsion. Cell 97, 927- 

941); Song, H., Ming, G., He, Z., Lehmann, M., Tessier-Lavigne, M., and Poo, M. 

(1998). Conversion of neuronal growth cone responses from repulsion to attraction by 
15 cyclic nucleotides [see comments]. Science 281, 1515-1518). 

Example 6 

Mixed-culture assays and time-lapse videomicroscopy 

Mock-transfected and plexin-A3 overexpressing MDCK cells were seeded with 
mesenchymal cells (NIH 3T3, KJ29, D17, among others), in multiwell culture plates by 

20 1:4 or 1:1 ratio. NIH and KJ-29 cells were sometimes labeled by addition of Dil (1,1'- 
dioctadecyl-3,3,3',3'-tetramethylindocarbocyanine perchlorate, Fluka) in the culture 
medium, 4 hours before harvesting for the assay; clusters of cells marked with this dye 
are marked in blue (in light microscopy) and emit red epifluorecence (TRITC filter). 
The repelling effect was observed 16-30 hours after confluency, by contrast phase 

25 microscopy using Leica DM IL. The progress of the assays was also monitored by time- 
lapse video-microscopy (320 minutes recording were converted into 1 minute play). To 
determine the time-length of cell contacts, for each assay, randomly chosen fibroblasts 
were followed during several hours and the duration of each contact between their 
lamellipodia and MDCK cells was measured. The doubling time of cells and their 

30 viability during the assay could also be analyzed, and no differences were observed in 
presence of control or plexin-A3 expressing cells. Substrate adhesion of plexin-A3 
overexpressing MDCKs was analyzed by counting attached cells after 30 minutes from 
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seeding on micro-wells coated with fibronectin, collagen or polylysin, in the absence of 
calf serum: no differences versus control cells were observed. 
Example 7 
Apoptosis detection 

5 TUNEL reaction (Boehringer detection kit) was performed on mixed coltures of 

MDCK and NIH3T3 cells, 24 hours after seeding in a 24- well culture plate. The 
labeling was converted into a colorimetric signal for analysis by light microscopy using 
the TUNEL- AP detection kit (Boehringer). As a positive control for the induction of 
apoptosis, the same cells were treated with UV-C (50 uJ/cm 2 ) or luM staurosporin. 

10 Although the foregoing invention has been described in some detail by way of 

illustration and example for purposes of clarity of understanding, it will be apparent to 
those skilled in the art that certain changes and modifications will be practiced. 
Therefore, the description and examples should not be construed as limiting the scope 
of the invention, which is delineated by the appended claims. 

15 Example 8 

Plexins are specific receptors for cell surface semaphorins in vertebrates 

Plexin-Cl (VESPR) has been shown to bind the soluble viral semaphorins Sema 
VA and VB (Comeau et al, 1998 supra), and we recently found that Drosophila Plexin 
A (D Plex A) interacts with transmembrane Sema la (Winberg et al., 1998 supra). We 

20 therefore examined in vertebrates whether the extracellular domain of several different 
cellular semaphorins -fused to alkaline phosphatase- could bind members of the human 
plexin-A, -B and -C subfamilies. Multiple secreted semaphorins of class 3 (Sema3A, 
Sema 3C or Sema3F; see below) did not interact with plexins-Al, -A2, -A3, -Bl, B2, or 
-CI (data not shown). In contrast, plexin-Cl(Vespr) specifically bound Sema7A(Sema- 

25 Kl) (Fig. 2a), a GPI-membrane linked semaphorin (class 7). This result is not entirely 
unexpected, since Sema7A may represent the cellular counterpart of viral semaphorin 
SemaVB, previously shown to interact with this plexin (Comeau et al., 1998 supra). 
More interestingly, the class 4 transmembrane semaphorin Sema4D (CD 100) did 
interact strongly and specifically with plexin-Bl (Fig. 2a). Thus the prototypes of two 

30 distinct plexin families are the receptors for members of two distinct semaphorin sub- 
classes. We also found that Sema7A and Sema4D do not bind to neuropilin-1 or -2 
alone, nor did co-transfection of either neuropilin with plexin-B 1 significantly modify 
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its binding efficiency (not shown). Neuropilins thus seem so far to function as receptors 
only for vertebrate semaphorins of class 3. 

The affinity constant of Sema4D for plexin-B 1 was estimated by Scatchard plot 
to be in the subnanomolar range (K D = 0.9 nM, Fig. 2b; the estimated K D of Sema7A 
5 for plexin-Cl is 2.1 nM, not shown). These values are consistent with those observed 
for semaphorins-neuropilins, and fly semaphorinl-Plexin A interactions (He and 
Tessier-Lavigne, 1997 supraWinberg et al., 1998). 

We used two deletion constructs of plexin-B 1 to explore the semaphorin 
binding sites of plexins. Neither the N-terminal half of plexin-B 1 extracellular domain 
10 ("plexin-B 1 truncated", see previous paragraph), nor its C-tcrminal half ("plexin-B 1- 
Asema", see Experimental Procedures) was sufficient alone to bind CD 100 (see Fig. 
2a), suggesting that the binding of Sema4D depends on multiple structural determinants 
of the extracellular domain of plexin-B 1. 
Example 9 

15 Plexins associate with class 3 Semaphorin receptors, Neuropilins 

As outlined above, secreted semaphorins of subclass 3 are known to bind 
neuropilins (He and Tessier-Lavigne, 1997 supra; Kolodkin et al., 1997 supra; Chen, 
H., Chedotal, A., He, Z., Goodman, C.S., and Tessier-Lavigne, M. (1997) "Neuropilin- 
2, a novel member of the neuropilin family, is a high affinity receptor for the 

20 semaphorins Sema E and Sema IV but not Sema D3." Neuron 19, 547-559). However, 
the short cytoplasmic tail of neuropilins seems to be dispensable for their biological 
activity (Nakamura, et al (1998) supra), indicating the requirement of an associated co- 
receptor for signal transduction. Interestingly, in Drosophila (where neuropilins have 
not been identified to date) Plexin A is sufficient to mediate the biological response to 

25 semaphorin-1 in axon guidance (Winberg et al., \99%supra). 

In an initial set of experiments, we could not observe binding of the class 3 
semaphorins Sema3A(Sema HJ), Sema3C(Sema E) or Sema3F(Sema IV) to plexins-Al, 
A2, A3, BL B2 or CI (not shown). To test whether plexins might be coreceptors with 
neuropilins for class 3 semaphorins, we set up co-precipitation experiments in COS 

30 cells to test whether neuropilins may interact with plexins. Three tested plexins (plexin- 
Al, -A3 and -Bl) associated both with neuropilin-2 (Np2, shown in Fig. 3) and 
neuropilin- 1 (not shown). The binding was specific, inasmuch as neither neuropilin nor 
any plexins coimmunoprecipitated with the netrin receptor DCC, under conditions 
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where DCC coimmunoprecipitated with the other netrin receptor UNC5H2 (Fig. 3 and 
data not shown). We observed finally that the plexin-neuropilin association is mediated 
by the sema domain of plexins, as demonstrated using either the "plexin-Bl truncated" 
splice variant (Figure 3) or an even shorter form of the extracellular domain ("plexin- 

5 Bl-sema", see Experimental procedures, not shown). 

To further support the idea of a plexin-neuropilin multimeric receptor complex 
for semaphorins, we show here that plexin-A3 (e.g.) is expressed in a large number of 
neuronal classes, including sensory, sympathetic, motor, and olfactory bulb neurons 
(Figure 4 and data not shown), which are known to respond to class 3 semaphorins, and 

10 which express either neuropilin-1 or neuropilin-2 or both (Chen et al., 1997 supra; 
Feiner, L., Koppel, A.M., Kobayashi, H., and Raper, J.A. (1997). Secreted chick 
semaphorins bind recombinant neuropilin with similar affinities but bind different 
subsets of neurons in situ. Neuron 19, 539-545; He and Tessier-Lavigne, 1997 supra; 
Kolodkin et al, 1997 supra). Thus, plexin-A3 is a candidate for a physiological 

15 coreceptor involved in mediating class 3 semaphorin effects on these axons. Other 

plexins may also have a role as neuropilin coreceptors in specific cell populations, such 
as plexin-A2, which is expressed in a subset of sensory neurons and in dorsal horn 
cells, and plexin-Al, which is expressed at low levels and broadly in the spinal cord 
(Figure 4). 

20 To directly test the possible involvement of plexins in class 3 semaphorin signal 

transduction, we studied the repulsive responses of Xenopus spinal neurons to Sema3A, 
which is mediated by a receptor mechanism involving neuropilin-1 (Song et al., 1998 
supra). We asked whether these responses could be altered by expression of a 
presumed dominant-negative plexin-Al construct lacking the cytoplasmic domain of 

25 the protein. Transmembrane proteins can be reliably expressed in these neurons by 
injecting the encoding mRNA at the developmental two cell stage, allowing the 
embryos to grow to tadpole stage, and then removing the spinal cord and culturing the 
neurons (Hong et al., 1999 supra). We therefore injected the mRNA encoding the 
truncated plexin-Al construct, together with mRNA encoding GFP (as a reporter) and 

30 then studied the responses of spinal neurons expressing GFP that were derived from 
these embryos. Whereas control spinal neurons are repelled by Sema3A (Figure 5A, B 
and Song et al. 1998 supra), neurons from embryos injected with mRNA for truncated 
plexin-Al did not respond with either repulsion or attraction to Sema3A (Figure 5C, 
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D). This blocking effect appeared to be specific, since expression of a different 
heterologous receptor, UNC5H2, did not impair repulsion by Sema3A (Hong et al., 
1997 supra), and since expression of the truncated plexin construct did not block 
attractive responses to netrin-1 (Figure 5E, F). Figure 5G, H quantifies these effects. 

5 As can be seen, the effect of Scma3A is completely abolished by the truncated plexin; 
although there is a slight apparent decrease in the attractive effect of netrin-1 the effect 
is not statistically significant. 

Although we have used a truncated plexin-Al construct, this construct may be 
expected to interfere with the function of various plexins, since all the plexins tested 

10 (Al, A3 and Bl) associated with neuropilin-1. These results support a role for one or 
more plexins in mediating the repulsive Sema3A signal in the Xenopus spinal neurons. 
Example 10 

Plexins signal via a novel type of tyrosine phosphorylated cytoplasmic domain 

The sequences of plexin cytoplasmic domains are highly conserved among 

15 plexins but do not match any known sequences. We found that the plexin-A3 and 
plexin-Bl proteins are phosphorylated on tyrosine residues when overexpressed in 
human kidney cells (BOSC-23), as demonstrated using anti-phosphotyrosine antibodies 
(Fig. 6a). Furthermore, after immunoprecipitation and in vitro kinase assays, plexin-A3 
and plexin-Bl became phosphorylated (Fig. 6b). Resistance to an alkali treatment (see 

20 Experimental procedures) confirmed the specific phosphorylation of tyrosine residues. 

The cytoplasmic domains of several receptors, including Met proteins, become 
tyrosine phosphorylated owing to an intrinsic kinase activity (Ullrich, A. and 
Schlessinger, J. (1990) "Signal transduction by receptors with tyrosine kinase activity." 
Cell 61, 203-212). Since the cytoplasmic domain of plexins is not similar to any bona 

25 fide or atypical tyrosine kinase, this suggests that a distinct tyrosine kinase co- 

immunoprecipitates in association with plexins, and is responsible for their tyrosine 
phosphorylation. Although some additional phosphorylated proteins can be found 
specifically with plexin-A3 and -Bl, we have not as yet identified this associated 
kinase. A number of endogenously expressed tyrosine kinases, namely Met, Ron, Abl 

30 and Src, were not found associated with plexin-A3 by immunoprecipitation and 

Western blotting (not shown). Since tyrosine phosphorylated residues often function as 
docking sites for intracellular signal transducers (Cantley, L.C., Auger, K.R., Carpenter, 
C, Duckworth, B., Graziani, A., Kapeller, R., and Soltoff, S. (1991) "Oncogenes and 
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signal transduction." Cell 64, 281-302), the fact that the cytoplasmic domains of 
plexins are tyrosine phosphorylated further suggests that they are part of signaling 
complexes. 
Example 11 

5 Plexin-A3 expressing cells induce repulsion of co-cultured cells 

Stable transfectants expressing recombinant human plexin-A3 were successfully 
obtained in four different cell lines: IMR32 and AF8 (human neuroblasts), and BOSC- 
23 and MDCK (human and canine kidney cells, respectively). We observed modest 
phenotypic changes in the transfected cells, which generally become flatter and larger in 

10 size. The growth rate of plexin-A3 overexpressing cells was comparable to parental 
lines and we did not observe differences in the ability to adhere on different substrates 
(data not shown). 

In keeping with previous report on the related Plexin of Xenopus laevis (Ohta, 
K., Mizutani, A., Kawakami, A., Murakami, Y., Kasuya, Y., Takagi, S., Tanaka, H., 

15 and Fujisawa, H. (1995). "Plexin: a novel neuronal cell surface molecule that mediates 
cell adhesion via a homophilic binding mechanism in the presence of calcium ions." 
Neuron 14, 1189-1199), we observed a modest increase in calcium-dependent 
homotypic cell aggregation of plexin-A3 transfectants (not shown). Surprisingly, we 
found that epithelial MDCK cells overexpressing plexin-A3 mediate strong repelling 

20 cues for adjacent cells. This was observed by co-culturing mock-transfected and plexin- 
A3 overexpressing MDCK cells together with several non-epithelial cell lines (such as 
NIH3T3, Kj29, and D17; Fig. 7A). Mock MDCKs grew alongside mesenchymal cells 
until confluency, when both cell types stopped proliferating. In contrast, when plexin- 
A3-overexpressing epithelial cells were grown in the same conditions, the adjacent 

25 mesenchymal cells withdrew from them, and ultimately detached from the plate. 

To analyze the dynamics of this repulsion process, we monitored for 36 hours, by 
time-lapse video-microscopy, mixed cultures of transfected MDCK cells and 
fibroblasts, in a number of independent experiments. At low cell density, fibroblasts 
showed intrinsic motility, exploring the surface of the plate with long lamellipodia and 

30 filopodia, and thus coming in contacts with a high number of stationary MDCK islets. 
The time-length of the contacts between fibroblasts and control MDCK cells varied 
from 30 minutes to several hours, lasting mostly over 100 minutes. However, when 
fibroblasts were cultured with MDCK cells overexpressing plexin-A3, transient 
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contacts were observed, often lasting less than 30 minutes (see Fig. 7C). At higher cell 
density, fibroblasts stopped and clustered alongside the islands of control MDCKs, 
whereas they kept moving in a hectic fashion between the islands of plexin-A3 
transfected cells (data not shown). 

5 This cell-repelling effect is not due to the release of soluble factors, since 

exchanging conditioned media between mixed cultures was without effect (not shown). 
Moreover, the two different cell populations grew normally until they came into 
contact, indicating that the repelling effect requires cell-cell interaction. To rule out the 
possibility that plexin-A3 expressing cells generate an apoptotic signal for fibroblasts, 

10 we monitored cell viability and apoptosis by TUNEL staining. As shown in Figure 7B, 
the clusters of repelled fibroblasts did not include apoptotic cells; furthermore, the 
detaching cells still excluded Trypan blue stain and were able to spread again on a new 
culture plate (not shown). 

Taken together, these results demonstrate that in our experimental system, plexin- 

15 A3 mediates cell repelling cues, presumably by interacting with surface bound ligands 
on opposing cells. We could not identify -so far- the specific ligand for plexin-A3, 
however we propose that this may be a transmembrane semaphorin. It should be noted 
that the intracellular domains of transmembrane semaphorins, such as Sema4D, also 
include tyrosine residues, which may themselves become phosphorylated and associate 

20 with cytoplasmic signal transducer molecules, a property shown for ligands of the 
ephrin family (Holland, S.J., Gale, N.W., Mbamalu, G., Yancopoulos, G.D., 
Henkemeyer, M, and Pawson, T. (1996) "Bidirectional signalling through the EPH- 
family receptor Nuk and its transmembrane ligands." Nature 383, 722-725). 
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What is claimed is: 



1. Isolated nucleic acid having at least 80% nucleic acid sequence identity to a 
nucleotide sequence that encodes an amino acid sequence selected from the 
group consisting of the amino acid sequence shown in (SEQ ID NO: 2 
(plexin B-2)), (SEQ ID NO: 4 (plexin B-3)), (SEQ ID NO: 6 (plexin D-l)) 
and (SEQ ID NO: 8 (plexin A-4)) 

2. Isolated nucleic acid having at least 80% nucleic acid sequence identity to a 
nucleotide sequence selected from the group consisting of the nucleotide 
sequence shown (SEQ ID NO: 1 (plexin B-2)), (SEQ ID NO: 3 (plexin B- 
3)), (SEQ ID NO: 5 (plexin D-l)) and (SEQ ID NO: 7 (plexin A-4)). 

3. A vector comprising the nucleic acid of any one of claims 1 or 2. 

4. An isolated polypeptide having at least 80% amino acid sequence identity to 
an amino acid sequence selected from the group consisting of the amino acid 
sequence shown in (SEQ ID NO: 2 (plexin B-2)), (SEQ ID NO: 4 (plexin B- 
3)), (SEQ ID NO: 6 (plexin D-l)) and (SEQ ID NO: 8 (plexin A-4)). 

5. An isolated polypeptide having at least 80% amino acid sequence identity 
to: 

(a) the polypeptide shown in (SEQ ID NO: 2 (plexin B-2)), (SEQ ID 
NO: 4 (plexin B-3)), (SEQ ID NO: 6 (plexin D-l)) and (SEQ ID NO: 8 (plexin 
A-4)), lacking its associated signal peptide; 

(b) an extracellular domain of the polypeptide shown in (SEQ ID 
NO: 2 (plexin B-2)), (SEQ ID NO: 4 (plexin B-3)), (SEQ ID NO: 6 (plexin D- 
1)) and (SEQ ID NO: 8 (plexin A-4)), with its associated signal peptide; or 

(c) an extracellular domain of the polypeptide shown in (SEQ ID 
NO: 2 (plexin B-2)), (SEQ ID NO: 4 (plexin B-3)), (SEQ ID NO: 6 (plexin D- 
1)) and (SEQ ID NO: 8 (plexin A-4)), lacking its associated signal peptide. 
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6. A chimeric molecule comprising a polypeptide according to claim 4 or 5 
fused to a heterologous amino acid sequence. 

7. The chimeric molecule of claim 6, wherein the heterologous amino acid 
sequence is a Fc region of an immunoglobulin. 

8. An antibody that specifically binds to a polypeptide according to claim 4 or 
5. 

9. The antibody according to claim 8, wherein the antibody is a monoclonal, a 
humanized antibody or a single-chain antibody. 

10. A method of suppressing or altering aberrant cell growth involving a 
signaling pathway between a plexin and a neuropilin in a mammal 
comprising the step of administering an effective amount of an agent to said 
mammal capable of interfering with the association between the plexin and 
neuropilin 

1 1. A method of treating, suppressing or altering a disorder involving aberrant 
immune regulation involving a signaling pathway between a plexin and a 
neuropilin in a mammal comprising the step of administering an effective 
amount of an agent to said mammal capable of interfering with the 
association between the plexin and neuropilin. 

12. The method according to claim 10 or 1 1 wherein said agent is a chimeric 
molecule according to claim 6 or 7. 

13. The method according to claim 10 or 11, wherein said agent is an antibody 
according to claim 8 or 9. 

14. A method of diagnosing or screening for tumors in a subject characterized 
by the expression profiles of the polypeptides according to claim 4 or 5 
wherein the expression profile of the polypeptides is different in a non- 
tumor sample as compared to the expression profile of the polypeptides in a 
tumor sample. 
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KJ-29 NIH-3T3 NIH-3T3 




control Plexin-A3 

Persistent contacts " Transient contacts * Very transient contacts 
(over 1 00 min) (less than 1 00 min) (less than 30 min) 



WO 01/14420 



PCT/USOO/23365 



SEQUENCE LISTING 

<110> University of Torino 

<12 0> Novel Plexins and Uses Thereof 



<130> A077PCT 

<140> Not assigned yet 
<141> 2000-08-25 

<150> 60/150576 
<151> 1999-08-25 

<160> 12 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 6252 

<212> DNA 

<213> HOMO SAPIEN 

<400> 1 

gcggggggca atggcactgc agctctgggc cctgaccctg ctgggcctgc tgggcgcagg 60 

tgccagcctg aggccccgca agctggactt cttccgcagc gagaaagagc tgaaccacct 12 0 

ggctgtggat gaggcctcag gcgtggtgta cctgggggcg gtgaatgccc tctaccagct 180 

ggatgcgaag ctgcagctgg agcagcaggt ggccacgggc ccggccctgg acaacaagaa 240 

gtgcacgccg cccatcgagg ccagccagtg ccatgaggct gagatgactg acaatgtcaa 300 

ccagctgctg ctgctcgacc ctcccaggaa gcgcctggtg gagtgcggca gcctcttcaa 360 

gggcatctgc gctctgcgcg ccctgagcaa catctccctc cgcctgttct acgaggacgg 42 0 

cagcggggag aagtctttcg tggccagcaa tgatgagggc gtggccacag tggggctggt 48 0 

gagctccacg ggtcctggtg gtgaccgcgt gctgtttgtg ggcaaaggca atgggccaca 540 

cgacaacggc atcatcgtga gcactcggct gttggaccgg actgacagca gggaggcctt 600 

tgaagcctac acggaccacg ccacctacaa ggccggctac ctgtccacca acacacagca 660 

gttcgtggcg gccttcgagg acggccccta cgtcttcttt gtcttcaacc agcaggacaa 720 

gcacccggcc cggaaccgca cgctgctggc acgcatgtgc agagaagacc ccaactacta 780 

ctcctacctg gagatggacc tgcagtgccg ggaccccgac atccacgccg ctgcctttgg 840 

cacctgcctg gccgcctccg tggctgcgcc tggctctggc agggtgctat atgctgtctt 900 

cagcagagac agccggagca gtggggggcc cggtgcgggc ctctgcctgt tcccgctgga 960 

caaggtgcac gccaagatgg aggccaaccg caacgcctgt tacacaggca cccgggaggc 102 0 

ccgtgacatc ttctacaagc ccttccacgg cgatatccag tgcggcggcc acgcgccggg 1080 

ctccagcaag agcttcccat gtggctcgga gcacctgccc tacccgctgg gcagccgcga 1140 

cgggctcaga ggcacagccg tgctgcagcg tggaggcctg aacctcacgg ccgtgacggt 1200 

cgccgccgag aacaaccaca ctgttgcttt tctgggcacc tctgatggcc ggatcctcaa 1260 

ggtgtacctc accccagatg gcacctcctc agagtacgac tctatccttg tggagataaa 1320 

caagagagtc aagcgcgacc tggtactgtc tggagacctg ggcagcctgt acgccatgac 1380 

ccaggacaag gtgttccggc tgccggtgca ggagtgcctg agctacccga cctgcaccca 1440 

gtgccgcgac tcccaggacc cctactgcgg ctggtgcgtc gtcgagggac gatgcacccg 1500 

gaaggccgag tgtccgcggg ccgaggaggc cagccactgg ctgtggagcc gaagcaagtc 1560 

ctgcgtggcc gtcaccagcg cccagccaca gaacatgagc cggcgggccc agggggaggt 1620 

gcagctgacc gtcagccccc tccctgccct gagcgaggag gacgagttgc tgtgcctttt 1680 

tggggagtcg ccgccacacc ccgcccgcgt ggagggcgag gccgtcatct gcaactcccc 1740 
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aagcagcatc cccgtcacac cgccaggcca ggaccacgtg gccgtgacca tccagctcct 1800 

ccctagacga ggcaacatct tcctcacgtc ctaccagtac cccttctacg actgccgcca 1860 

ggccatgagc ctggaggaga acctgccgtg catctcctgc gtgagcaacc gctggacctg 1920 

ccagtgggac ctgcgctacc acgagtgccg ggaggcttcg cccaaccctg aggacggcat 1980 

cgtccgcgcc cacatggagg acagctgtcc ccagttcctg ggacccagcc ccctggtgat 2040 

ccccatgaac cacgagacag atgtgaactt ccagggcaag aacctggaca ccgtgaaggg 2100 

ttcctccctg cacgtgggca gtgacttgct caagttcatg gagccggtga ccatgcagga 2160 

anccgggacc ttcgcctttc ggaccccaaa gctgtcccac gatgccaacg agacgctgcc 2220 

cctzgcacctc tacgtcaagt cttacggcaa gaatatcgac agcaagcccc atgtgaccct 2280 

ctacaactgc tcctttggcc gcagcgactg cagcctgtgc cgggccgcta accccgacta 2340 

caggtgcgcg tggtgcgggg gccagagcag gtgcgtgtat gaggccctgt gcaacaccac 2400 

cnccgagtgc ccgccgcccg tcatcaccag gatccagcct gagacgggcc ccctgggtgg 2460 

gggcarccgc atcaccatcc tggggtccaa tttgggcgtc caagcagggg acatccagag 2520 

gacctctgtg gccggccgga actgctcctt tcagccggaa cgttaccccg tgtccacccg 2580 

gaccgrgtgt gtgatcgagg ctgcggagac gcctttcacg gggggtgtcg aggtggacgt 2640 

ctccgggaaa ctgggccgtt cgcctcccaa tgtccagttc accttccaac agcccaagcc 2700 

tcccag-gtg gagccgcagc agggaccgca ggcgggcggc accacactga ccatccacgg 2760 

cacccacctg gacacgggct cccaggagga cgtgcgggtg accctcaacg gcgtcccgtg 2820 

taaagrgacg aagtttgggg cgcagctcca gtgtgtcact ggcccccagg cgacacgggg 2880 

ccagatgctt ctggaggtct cctacggggg gtcccccgtg cccaaccccg gcatcttctt 2940 

cacctaccgc gaaaaccccg tactgcgagc cttcgagccg ctacgaagct ttgccagtgg 3000 

tggccgcagc atcaacgtca cgggtcaggg cttcagcctg atccagaggt ttgccatggt 3060 

ggccatcgcg gagcccctgc agtcctggca gccgccgcgg gaggctgaat ccctgcagcc 3120 

cacgacggtg gtgggtacag actacgtgtt ccacaatgac accaaggtcg tcttcctgtc 3180 

cccggc-gtg cctgaggagc cagaggccca caacctcacg gtgctgatcg agatggacgg 3240 

gcaccgrgcc ctgctcagaa cagaggccgg ggccttcgag tacgtgcctg accccacctt 3300 

tgagaacttc acaggtggcg tcaagaagca ggtcaacaag ctcatccacg cccggggcac 33 60 

caacc-gaac aaggcgatga cgccgcagga ggccgaggcc ttcgtgggtg ccgagcgctg 3420 

caccacgaag acgctgacgg agaccgacct gtactgtgag cccccggagg tgcagccccc 3480 

gcccaagcgg cggcagaaac gagacaccac acacaacctg cccgagttca ttgtgaagtt 3540 

cggctc-cgc gagtgggtgc tgggccgcgt ggagtacgac acacgggtga gcgacgtgcc 3 600 

gcrcagcctc atcttgccgc tggtcatcgt gcccatggtg gtcgtcatcg cggtgtctgt 3660 

ctactgctac tggaggaaga gccagcaggc cgaacgagag tatgagaaga tcaagtccca 3720 

gccggagggc ctggaggaga gcgtgcggga ccgctgcaag aaggaattca cagacctgat 3780 

gatcgagatg gaggaccaga ccaacgacgt gcacgaggcc ggcatccccg tgctggacta 3840 

caagacctac accgaccgcg tcttcttcct gccctccaag gacggcgaca aggacgtgat 3900 

gatcaccggc aagctggaca tccctgagcc gcggcggccg gtggtggagc aggccctcta 3960 

ccagttctcc aacctgctga acagcaagtc tttcctcatc aatttcatcc acaccctgga 4020 

gaaccagcgg gagttctcgg cccgcgccaa ggtctacttc gcgtccctgc tgacggtggc 4080 

gccgcacggg aaactggagt actacacgga catcatgcac acgctcttcc tggagctcct 4140 

ggagcagtac gtggtggcca agaaccccaa gctgatgctg cgcaggtctg agactgtggt 4200 

ggagaggatg ctgtccaact ggatgtccat ctgcctgtac cagtacctca aggacagtgc 4260 

cggggagccc ctgtacaagc tcttcaaggc catcaaacat caggtggaaa agggcccggt 432 0 

ggatgcggta cagaagaagg ccaagtacac tctcaacgac acggggctgc tgggggatga 4380 

tgrggagtac gcacccctga cggtgagcgt gatcgtgcag gacgagggag tggacgccat 4440 

cccggtgaag gtcctcaact gtgacaccat ctcccaggtc aaggagaaga tcattgacca 4500 

gg-gtaccgt gggcagccct gctcctgctg gcccaggcca gacagcgtgg tcctggagtg 4560 

gcgtccgggc tccacagcgc agatcctgtc ggacctggac ctgacgtcac agcgggaggg 4620 

ccggtggaag cgcgtcaaca cccttatgca ctacaatgtc cgggatggag ccaccctcat 4680 

cctgtccaag gtgggggtct cccagcagcc ggaggacagc cagcaggacc tgcctgggga 4740 

gcgccacgcc ctcctggagg aggagaaccg ggtgtggcac ctggtgcggc cgaccgacga 4800 

ggcggacgag ggcaagtcca agagaggcag cgtgaaagag aaggagcgga cgaaggccat 4860 

caccgagatc tacctgacgc ggctgctctc agtcaagggc acactgcagc agtttgtgga 492 0 

caacttcttc cagagcgtgc tggcgcctgg gcacgcggtg ccacctgcag tcaagtactt 4980 

cttcgacttc ctggacgagc aggcagagaa gcacaacatc caggatgaag acaccatcca 5 040 
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catctggaag acgaacagct taccgctccg gttctgggtg aacatcctca agaaccccca 5100 

cttcatcttt gacgtgcatg tccacgaggt ggtggacgcc tcgctgtcag tcatcgcgca 5160 

gaccttcatg gatgcctgca cgcgcacgga gcataagctg agccgcgatt ctcccagcaa 5220 

caagctgctg tacgccaagg agatctccac ctacaagaag atggtggagg attactacaa 5280 

ggggatccgg cagatggtgc aggtcagcga ccaggacatg aacacacacc tggcagagat 5340 

ttcccgggcg cacacggact ccttgaacac cctcgtggca ctccaccagc tctaccaata 5400 

cacgcagaag tactatgacg agatcatcaa tgccttggag gaggatcctg ccgcccagaa 5460 

gatgcagctg gccttccgcc tgcagcagat tgccgctgca ctggagaaca aggtcactga 5520 

cctctgacct acaatctcca gtgctgcctt gggacatagg tacctgaggt acctgagagc 5580 

ccctcagggg aggaggccga gtggctgtgg ctgaggcccc caccctcccc tggaacgcgc 5640 

cccaagccgg agtgggtgca gccggaaccc gcccagcgtc tagactgtag catcttcctc 5700 

tgagcaatac cgccgggcac cgcaccagca ccagccccag ccccagctcc ctccggccgc 5760 

agaaccagca tcgggtgttc actgtcgagt ctcgagtgat ttgaaaatgt gccttacgct 5820 

gccacgctgg gggcagctgg cctccgcctc cgcccacgca ccagcagccg cctccatgcc 5880 

ctaggttggg cccctggggg atctgagggc ctgtggcccc cagggcaagt tcccagatcc 5940 

tatgtctgtc tgtccaccac gagatgggag gaggagaaaa agcggtacga tgccttcctg 6000 

acctcaccgg cctccccaag ggtgccggca ctctgggtgg actcacggct gctgggcccc 6060 

acgtcaaagg tcaagtgaga cgtaggtcaa gtcctacgtc ggggcccaga catcctgggg 612 0 

tcctggtctg tcagacaggc tgccctagag ccccacccag tccgggggga ctgggagcag 6180 

ttccaagacc accccacccc tttttgtaaa tcttgttcat tgtaaatcaa atacagcgtc 6240 

tttttcactc eg 6252 
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Ser Leu He Gin 


Arg 


Phe Ala Met 


Val 


Val 


He 


Ala 


Glu 


Pro 


Leu 


Gin 
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Ser Trp Gin Pro 
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Val 
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1060 1065 1070 

He Glu Met Asp Gly His Arg Ala Leu Leu Arg Thr Glu Ala Gly Ala 
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1075 1080 1085 

Phe Glu Tyr Val Pro Asp Pro Thr Phe Glu Asn Phe Thr Gly Gly Val 

1090 1095 1100 

Lys Lys Gin Val Asn Lys Leu lie His Ala Arg Gly Thr Asn Leu Asn 
1105 1110 1115 1120 

Lys Ala Met Thr Leu Gin Glu Ala Glu Ala Phe Val Gly Ala Glu Arg 

1125 1130 1135 

Cys Thr Met Lys Thr Leu Thr Glu Thr Asp Leu Tyr Cys Glu Pro Pro 

1140 1145 1150 

Glu Val Gin Pro Pro Pro Lys Arg Arg Gin Lys Arg Asp Thr Thr His 

1155 1160 1165 

Asn Leu Pro Glu Phe lie Val Lys Phe Gly Ser Arg Glu Trp Val Leu 

1170 1175 1180 

Gly Arg Val Glu Tyr Asd Thr Arg Val Ser Asp Val Pro Leu Ser Leu 
1185 1190 1195 1200 

He Leu Pro Leu Val He Val Pro Met Val Val Val He Ala Val Ser 

1205 1210 1215 

Val Tyr Cys Tyr Trp Arg Lys Ser Gin Gin Ala Glu Arg Glu Tyr Glu 

1220 1225 1230 

Lys He Lys Ser Gin Leu Glu Gly Leu Glu Glu Ser Val Arg Asp Arg 

1235 1240 1245 

Cys Lys Lys Glu Phe Thr Asp Leu Met He Glu Met Glu Asp Gin Thr 

1250 1255 1260 

Asn Asp Val His Glu Ala Gly He Pro Val Leu Asp Tyr Lys Thr Tyr 
1265 1270 1275 1280 

Thr Asp Arg Val Phe Phe Leu Pro Ser Lys Asp Gly Asp Lys Asp Val 

1285 1290 1295 

Met He Thr Gly Lys Leu Asp lie Pro Glu Pro Arg Arg Pro Val Val 

1300 1305 1310 

Glu Gin Ala Leu Tyr Gin Phe Ser Asn Leu Leu Asn Ser Lys Ser Phe 

1315 1320 1325 

Leu He Asn Phe He His Thr Leu Glu Asn Gin Arg Glu Phe Ser Ala 

1330 1335 1340 

Arg Ala Lys Val Tyr Phe Ala Ser Leu Leu Thr Val Ala Leu His Gly 
1345 1350 1355 1360 

Lys Leu Glu Tyr Tyr Thr Asp He Met His Thr Leu Phe Leu Glu Leu 

1365 1370 1375 

Leu Glu Gin Tyr Val Val Ala Lys Asn Pro Lys Leu Met Leu Arg Arg 

1380 1385 1390 

Ser Glu Thr Val Val Glu Arg Met Leu Ser Asn Trp Met Ser He Cys 

1395 1400 1405 

Leu Tyr Gin Tyr Leu Lys Asp Ser Ala Gly Glu Pro Leu Tyr Lys Leu 

1410 1415 1420 

Phe Lys Ala He Lys His Gin Val Glu Lys Gly Pro Val Asp Ala Val 
1425 1430 1435 1440 

Gin Lys Lys Ala Lys Tyr Thr Leu Asn Asp Thr Gly Leu Leu Gly Asp 

1445 1450 1455 

Asp Val Glu Tyr Ala Pro Leu Thr Val Ser Val He Val Gin Asp Glu 

1460 1465 1470 

Gly Val Asp Ala He Pro Val Lys Val Leu Asn Cys Asp Thr He Ser 

1475 1480 1485 

Gin Val Lys Glu Lys He He Asp Gin Val Tyr Arg Gly Gin Pro Cys 

1490 1495 1500 

Ser Cys Trp Pro Arg Pro Asp Ser Val Val Leu Glu Trp Arg Pro Gly 
1505 1510 1515 1520 
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Ser Thr Ala Gin He Leu Ser Asp Leu Asp Leu Thr Ser Gin Arg Glu 

1525 1530 1535 

Gly Arg Trp Lys Arg Val Asn Thr Leu Met His Tyr Asn Val Arg Asp 

1540 1545 1550 

Gly Ala Thr Leu He Leu Ser Lys Val Gly Val Ser Gin Gin Pro Glu 

1555 1560 1565 

Asp Ser Gin Gin Asp Leu Pro Gly Glu Arg His Ala Leu Leu Glu Glu 

1570 1575 1580 

Glu Asn Arg Val Trp His Leu Val Arg Pro Thr Asp Glu Val Asp Glu 
1585 1590 1595 1600 

Gly Lys Ser Lys Arg Gly Ser Val Lys Glu Lys Glu Arg Thr Lys Ala 

1605 1610 1615 

He Thr Glu He Tyr Leu Thr Arg Leu Leu Ser Val Lys Gly Thr Leu 

1620 1625 1630 

Gin Gin Phe Val Asp Asn Phe Phe Gin Ser Val Leu Ala Pro Gly His 

1635 1640 1645 

Ala Val Pro Pro Ala Val Lys Tyr Phe Phe Asp Phe Leu Asp Glu Gin 

1650 1655 1660 

Ala Glu Lys His Asn He Gin Asp Glu Asp Thr He His He Trp Lys 
1665 1670 1675 1680 

Thr Asn Ser Leu Pro Leu Arg Phe Trp Val Asn lie Leu Lys Asn Pro 

1685 1690 1695 

His Phe He Phe Asp Val His Val His Glu Val Val Asp Ala Ser Leu 

1700 1705 1710 

Ser Val He Ala Gin Thr Phe Met Asp Ala Cys Thr Arg Thr Glu His 

1715 1720 1725 

Lys Leu Ser Arg Asd Ser Pro Ser Asn Lys Leu Leu Tyr Ala Lys Glu 

1730 1735 1740 

He Ser Thr Tyr Lys Lys Met Val Glu Asp Tyr Tyr Lys Gly He Arg 
1745 1750 1755 1760 

Gin Met Val Gin Val Ser Asp Gin Asp Met Asn Thr His Leu Ala Glu 

1765 1770 1775 

He Ser Arg Ala His Thr Asp Ser Leu Asn Thr Leu Val Ala Leu His 

1780 1785 1790 

Gin Leu Tyr Gin Tyr Thr Gin Lys Tyr Tyr Asp Glu He He Asn Ala 

1795 1800 1805 

Leu Glu Glu Asp Pro Ala Ala Gin Lys Met Gin Leu Ala Phe Arg Leu 
1810 1815 1820 



Gin Gin He Ala Ala Ala Leu Glu Asn Lys Val Thr Asp Leu 



1825 


1830 


1835 
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ccacttggca 


120 


ctggcacctg 


gccgaggcac 


actctatgtc 


ggcgcagtga 


accgcctctt 


ccagctcagc 


180 


cccgagctgc 


agctcgaggc 


cgtggctgtc 


actggccctg 


taatcgacag 


ccctgactgc 


240 


gtgcccttcc 


gtgacccagc 


cgagtgccca 


caggcccagc 


tcactgacaa 


tgccaaccag 


300 


ctgctgctgg 


tgagcagccg 


cgcccaggag 


ctggtggcct 


gcgggcaggt 


gcggcagggc 


360 


gtgtgtgaga 


cacggcgcct 


tggggatgtg 


gccgaggtgc 


tgtaccaggc 


tgaggaccct 


420 


ggtgacgggc 


agtttgtggc 
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ccgggagtgg 


caacggtggg 


gctggtggtg 


480 



- 7 - 



WO 01/14420 PCT/US00/23365 



cccttgcccg gccgggacct cctgcttgtg gccagaggcc tggcgggcaa gctgtcggca 540 

ggggtgccac ccctggccat ccgccagctg gccgggtctc agcccttctc cagcgagggc 600 

ctgggccgcc tggtggtggg cgacttctcc gactacaaca acagctacgt cggggccttt 660 

gccgacgccc gctccgccta cttcgtgttc cgccgccgcg gggcccgggc ccaggctgag 720 

taccgctcct acgtggcccg cgtctgcctg ggggacacca acctgtactc ctacgtggag 780 

gtccccctcg cctgccaggg ccagggcctc atccaggccg ccttccttgc cccgggcacc 840 

ttgctagggg tgtttgccgc gggcccaagg ggcacccagg cggcgctctg tgccttcccc 900 

atggtggagc tgggtgccag catggagcag gcccggagac tctgctacac ggcgggcggc 960 

cggggcccca gcggcgcaga ggaagccacc gtggagtacg gcgtcacgtc gcgctgcgtc 102 0 

accctgcccc ttgattcccc cgagtcgtac ccctgtggcg acgagcacac ccccagcccc 1080 

attgctggcc gccagcccct ggaggtccag cctctgctga agctcgggca gccggtcagc 1140 

gccgtggcag ctctccaggc agatgggcac atgatagcct tcctggggga cacccagggc 1200 

cagctgtaca aggtctttct ccacggctcc cagggccagg tttaccactc ccagcaagtg 1260 

gggcctccag gctcagccat cagcccagac ctgctgctgg acagcagtgg cagtcacctc 1320 

tatgtcctga ctgcccacca ggtggaccgg atacctgtgg cagcctgccc ccagttccct 1380 

gactgtgcca gctgccccca ggcccaggac ccgccgtgtg gctggtgtgt cctccagggc 1440 

aggtgtaccc ggaagggcca gtgcgggcgg gcaggccagc tgaaccagtg gctgtggagt 150 0 

tatgaggagg acagccactg cctgcacatc cagagcctgc tgccgggcca ccacccccgc 1560 

caggagcagg gccaggtcac tttgtctgtc ccccggctgc ccatcctgga tgcagatgaa 162 0 

tacttccatt gtgcgttcgg ggactatgac agcttggctc atgtggaagg gccccacgtg 1680 

gcctgtgtca cccctcccca agaccaggtg ccacttaacc ctccaggcac agaccacgtc 1740 

actgtgcccc tggccctgat gttcgaggac gtgactgtgg ctgccaccaa cttctccttt 1800 

tatgactgca gtgccgtcca ggccttggag gcggctgccc ccgtccttcc ccagggcctg 1860 

cctgcctcct tccactgctg gctggagctg cctggagaac ttcggggact gccggccacc 1920 

ctggaggaga cagcagggga ttcaggcctc atccactgcc aggcccacca gcgggagctc 1980 

ccagtgccca tctacgtcac ccagggtgaa gcccagaggc tggacaacac ccatgctctt 2 040 

tatggtgagc ctgagggcag ccaggcaggc ggggcagggt gggtggcaga caggaggcgc 2100 

tcagcacact gcctgacccc ccctagtgat cctgtacgac tgcgccatgg gccacccgga 2160 

ccgcagccac tgccaagcgg ccaacaggag cccgggctgc ctgtgaccag ccctgcccca 2220 

ggcccccaaa ccccagcagc tcggcctggc tgggctggtt cgctggccgg gcacccagca 2280 

ctgcagagtg gagcgtgggt gcgggggacc ccatctgcca tcatttgcct gctgcaggtc 2340 

gagcccctga ccggtccccc tgagggaggc ttggccctca ccatcctggg ctccaacctg 2400 

ggccgggcct tcgccgatgt gcagtacgcc gaccctgtcc tgctgagcct gagtcctcgc 2460 

tggggccccc aggcaggggg cacccagctc accatccgag gtcagcacct ccagacaggt 2 52 0 

ggcaacacca gtgccttcgt gggtggccaa ccctgtccca tgggtgggcg actgatccgt 2580 

gtcaggggca ccggcctaga cgtggtgcag cggcccctac tgtctgtgtg gctggaggct 2640 

gacgcagagg tgcaggcttc cagggcccag ccccaggacc cacagccaag gaggagctgt 2700 

ggagcccctg ctgcggaccc ccaggcttgt atccagctcg gtggggggct gctgcagcgc 2760 

acagcagagc ccagctcact ccacctgtgg tcggccctga atgccccaca gtgctccacc 2820 

gtctgctccg tcaactcgtc cagcctcctc ctgtgccgga gccctgctgt accagacaga 2880 

gcccacccgc agcgggtctt cttcacccta gacaacgtgc aagtggactt cgccagtgcc 2940 

agtgggggcc agggcttcct gtaccagccc aacccccgcc tggcacccct cagccgcgag 3 000 

gggcctgccc gcccctaccg cctcaagcca ggccatgtcc tggatgtgga gggcgagggc 3060 

ctcaacctgg gcatcagcaa ggaggaggtg cgcgtgcaca tcggccgcgg cgagtgcctg 312 0 

gtgaagacgc tcacgcgcac ccacctgtac tgcgagccgc ctgcgcacgc cccgcagcct 3180 

gccaatggct ccggcctgcc acagttcgtg gtgcagatgg gcaatgtgca gctggccctg 3240 

ggccctgtgc agtacgaggc tgaacccccg ctgtctgcct ttcccgtgga ggcccaggca 3300 

ggcgtgggca tgggtgctgc agtgctgatt gccgccgtgc tcctcctcac cctcatgtac 3360 

aggcacaaga gcaagcaggc cctgcgggac taccagaagg tgctagtgca gctggagagc 3420 

ctggagaccg gcgtgggaga ccagtgccgc aaggagttca cagacctcat gacggagatg 3480 

accgacctca gcagcgacct ggagggcagc gggatcccct tcctggacta ccgcacctac 3540 

gccgagcgcg ccttcttccc tggccatggc ggttgcccgc tgcagcccaa gcctgagggg 3600 

ccaggggagg acggccactg tgccactgtg cgccagggcc tcacgcagct ctccaacctg 3660 

ctcaacagca agctcttcct cctcacggtg agggccgtgt ggcgggagtg cccagtgggc 3720 

aaggaggtgg ggctggggaa ctactggcct gagacaaagg tgggggagga gacagagacc 3780 
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atggtggaga aactgctcac caactggctg tccatctgcc tgtacgcctt cctgagggag 3840 

gtggctggtg aaccactgta catgctcttc cgggccatcc agtaccaggt ggacaaaggc 3 900 

cccgtggacg ccgtgacagg caaggccaaa cggaccctga atgatagccg cttgctgcgg 3 960 

gaggacgtgg agttccagcc cctgacgctg atggtgctgg tggggcccgg ggctggcggg 402 0 

gccgcaggca gcagcgagat gcagcgcgtg ccagcccggg tgctcgacac ggacaccatc 4 080 

acccaggtca aggagaaggt gttggaccaa gtctacaagg gcaccccctt ctcccagagg 4140 

ccctcagtgc atgccctaga ccttggtgag agagccagcc ctgcccaccc accccaggga 4200 

cccttcccta cccctccggc acctggagcc cctcaactgt gtcttactat gaacataccc 4260 

acgctggagg atggcgagga ggggggggtg tgcctctggc acctggtgaa agccaccgag 4320 

gagccagaag gggccaaggt gcggtgcagc agcctgcggg agcgcgagcc agcaagggcc 4380 

aaggccattc cggaaatcta cctcacccgt ctgctgtcca tgaaggttgg tgcggcctgg 4440 

gtggctgggc ctgagaggag gctcagccag ggaccccgac cgagccaggg tgtgggaggg 4500 

gcaggggcag cctcagccgt ggatggcccc cacaccctgc cctccacaca gcccttatcc 4560 

cctgcctcgc agggcacgct gcagaagttt gtggacgaca ccttccaggc cattctcagc 4620 

gtgaaccggc ccatccccat cgccgtcaag tacctgtttg accttctgga tgagctagca 4680 

gagaagcacg gcatcgagga cccagggacc ctgcacatct ggaagaccaa cagtctgctg 4740 

ctgcggttct gggtgaatgc cttgaagaac ccacagctca tctttgatgt acgggtgtcg 4800 

gacaatgtgg acgccatcct tgctgtcatc gcccagacct tcattgactc ctgtaccacc 4860 

tcggagcata aagtgggccg ggtgagagca gtgccagcag cagcagctgg caggggcttg 4920 

aggaggaaag gcttatgggg gaagcctaga gggctgtgca cagagctctg ggtgggcagt 4980 

ggcagcatca tgggggcacc ttcacctccg agctcatgcc tagcgcctcc cctccctccg 5040 

gagcaggatt ccccagtgaa caaactgctc tacgcccggg agatcccacg ctacaagcag 5100 

atggtggaga ggtactatgc ggacattcgc cagagctctc cggcgagcta ccaggagatg 5160 

aactctgctt tggctgagct ctccgggaac tacacttctg ctccccactg tctggaggct 522 0 

ctgcaagaac tctacaacca catccacagg tactatgatc agattatcag tgccctggag 5280 

gaggaccctg tgggccagaa gctgcagctg gcctgccgcc tgcagcaggt cgccgccctg 5340 

gtggaaaaca aagtgactga cctgtga 5367 



<210> 4 

<211> 1788 

<212> PRT 

<213> HOMO SAPIEN 



<400> 4 



Met Ala 




Trp 


Pro 


Pro 


Phe 


Gly 


Leu 


Cys 


Leu 


Leu 






Leu 


Leu 


1 


5 










10 










15 




Ser Pro 


Pro 


Pro 


Leu 


Pro 


Leu 


Thr 


Gly 


Ala 


His 


Arg 


Phe 


Ser 


Ala 


Pro 






20 










25 










30 






Asn Thr 


Thr 


Leu 


Asn 


His 


Leu 


Ala 


Leu 


Ala 


Pro 


Gly 


Arg 


Gly 


Thr 


Leu 




35 










40 










45 








Tyr Val 


Gly 


Ala 


Val 


Asn 


Arg 


Leu 


Phe 


Gin 


Leu 


Ser 


Pro 


Glu 


Leu 


Gin 


50 








55 










60 










Leu Glu 


Ala 


Val 


Ala 


Val 


Thr 


Gly 


Pro 


Val 


He 


Asp 


Ser 


Pro 


Asp 


Cys 


65 








70 










75 










80 


Val Pro 


Phe 


Arg 


Asd 


Pro 


Ala 


Glu 


Cys 


Pro 


Gin 


Ala 


Gin 


Leu 


Thr 


Asp 






85 










90 










95 




Asn Ala 


Asn 


Gin 


Leu 


Leu 


Leu 


Val 


Ser 


Ser 


Arg 


Ala 


Gin 


Glu 


Leu 


Val 






100 










105 










110 






Ala Cys 


Gly 


Gin 


Val 


Arg 


Gin 


Gly 


Val 


Cys 


Glu 


Thr 


Arg 


Arg 


Leu 


Gly 


115 










120 










125 








Asp Val 


Ala 


Glu 


Val 


Leu 


Tyr 


Gin 


Ala 


Glu 


Asp 


Pro 


Gly 


Asp 


Gly 


Gin 


130 










135 










140 










Phe Val 


Ala 


Ala 


Asn 


Thr 


Pro 


Gly 


Val 


Ala 


Thr 


Val 


Gly 


Leu 


Val 


Val 
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145 








150 






155 








160 


Pro Leu 


Pro Gly Arg 


Asp Leu Leu 




Val Ala Arg 








Gly 








165 








170 






175 




Lys Leu Ser Ala Gly Val 


Pro Pro 


Leu 


Ala He Arg 


Gin 


Leu 


Ala 


Gly 






180 








185 






190 






Ser Gin 


Pro 
195 


Phe 


Ser 


Ser 


Glu Gly 
200 




Gly Arg Leu 


205 








Phe Ser 


Asp 


Tyr 


Asn 


Asn 


Ser Tyr 


Val 


Gly Ala Phe 


Ala 


Asp 






210 






215 




220 










Ser Ala 


Tyr 


Phe 


Val 


Phe 


Arg A.rg 


Arg 


Gly Ala Arg 


Ala 


Gin 


Ala 


Glu 


225 






230 






235 








240 


Tyr Arg 


Ser 


Tyr 


Val 


Ala 


Arg Val 


Cys 


Leu Gly Asp 


Thr 


Asn 




Tyr 






245 








250 






255 




Ser Tyr 


Val 


Glu 


Val 


Pro 


Leu Ala 


Cys 


Gin Gly Gin 


Gly Leu 


lie 


Gin 




260 








265 






270 






Ala Ala 


Phe 
275 


Leu 


Ala 


Pro 


Gly Thr 
280 


Leu 


Leu Gly Val 


Phe 
285 


Ala 


Ala 


Gly 


Pro Arg 


Gly 


Thr 


Gin 


Ala 


Ala Leu 


Cys 


Ala Phe Pro 


Met 


Val 


Glu 


Leu 


290 








295 




300 










Gly Ala 


Ser 


Met 


Glu 


Gin 


Ala Arg 




Leu Cys Tyr 


Thr 


Ala 






305 








310 






315 








j20 


Arg Gly 


Pro 


Ser 


Gly 


Ala 


Glu Glu 


Ala 


Thr Val Glu 


Tyr 


Gly 


Val 


Thr 






325 








330 






335 




Ser Arg 


Cys 


Val 


Thr 


Leu 


Pro Leu 


Asp 


Ser Pro Glu 


Ser 


Tyr 


Pro 


Cys 


340 








345 






350 






Gly Asp Glu His Thr 


Pro 


Ser Pro 


He 


Ala Gly Arg 


Gin 


Pro 


Leu 


Glu 




3 55 








360 






365 








Val Gin 


Pro 


Leu 


Leu 


Lys 


Leu Gly 


Gin 


Pro Val Ser 


Ala 


val 


Ala 


Ala 


370 










375 




380 










Leu Gin Ala Asp Gly His Met He 


Ala 


Phe Leu Gly 


Asp 


Thr 


Gin 


Gly 


385 








390 






395 








400 


Gin Leu Tyr Lys 


Val 


Phe 


Leu His 


Gly 


Ser Gin Gly 


Gin 


Val 


Tyr 


His 








405 








410 






415 




Ser Gin 


Gin Val 


Gly 


Pro 


Pro Gly 


Ser 


Ala He Ser 


Pro 


Asa 


Leu 


Leu 






420 






42 5 






43 0 






Leu Asp 


Ser 


Ser Gly 


Ser 


His Leu 




Val Leu Thr 


Ala 


His 






435 








440 






445 








Asp Arg 


He 


Pro 


Val 


Ala Ala Cys 


Pro 


Gin Phe Pro 


Asp 


Cys 


Ala 


Ser 


450 










455 




460 










Cys Leu Gin Ala Gin 


ASD 


Pro Leu 


Cys 


Gly Trp Cys 


Val 


Leu 


Gin 




465 








47 0 






475 








d fin 


Arg Cys Thr Arg 


Lys 


Gly Gin Cys 


Gly 


Arg Ala Gly 


Gin 


Leu 




Gin 








485 








490 






495 




Trp Leu 


Trp 


Ser 


Tyr 


Glu Glu Asp 


Ser 


His Cys Leu 


His 


He 


Gin 


Ser 


500 






505 






510 






Leu Leu 


Pro Gly His 


His 


Pro Arg 


Gin 


Glu Gin Gly 


Gin 


Val 


Thr 


Leu 




515 








520 






525 








Ser Val 


Pro Arg 


Leu 


Pro 


He Leu 


Asp 


Ala Asp Glu 


Tyr 


Phe 


His 


Cys 


530 










535 














Ala Phe Gly Asp Tyr Asp 


Ser Leu 


Ala 


His Val Glu 


Gly 


Pro 


His 


Val 


545 








550 






555 








560 


Ala Cys 


Val 


Thr 


Pro 


Pro Gin Asp 


Gin 


Val Pro Leu 


Asn 


Pro 


Pro 


Gly 






565 








570 






575 




Thr Asp 


His 


Val 


Thr 


Val 


Pro Leu 


Ala 


Leu Met Phe 


Glu Asp 


Val 


Thr 




580 








585 






590 
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V 1 

a 


Al Al 


Th 

r 


sn 


Ph 

e 


er 


Ph 


Tv 

yr 


sp 


ys 


er 


Al 


V 1 


n 


Al - 














n 


















eu 


, . 


Al 


Al 




V 1 


eu 




n 


CI 

y 




ro 


Al 


er 


Ph 




610 3 








615 










620 










. 


ys ^ 


eu 


Gl 

U 


j. 


ro 


Gl 

y 


Gl 

U 


eu 




" y 


eu 


ro 


Al 
. a 


Th 


695 








630 










635 












eu 


u 


r 


Al 


ri 

y 


sp 


er 


y 




e 


is 


ys 


m 


AT 


IS 








645 




















6 55 




ri 

n 


a fi- 
rs ^ J 




p 


Val 


Pro 


e 


Tv 


ai 




Gl 

n 


ri 

y 


Cl 


Al - 


ri 






660 




























A 


A n 


Thr 


His 


^1 




Tv 
yr 


ri 

y 


Gl 

U 


Pro 


Cl 


~ y 


5er 


n 




SU 675 










680 










685 








Al 


Gly Gly 


Al 

3 


Glv 
y 




V 1 


Al 

a 


-sp 




A a 
^rg 




er 


Al 


is 


ys 












695 










700 
















Ser 




Pr 


V 1 




L u 




H . 


Gl 


Pro 




ri 
































eu 


n ro 


eu 




er 




ri 


- 

n 


ri 


ro 


y 


eu 


ro 


V 1 










725 










73 0 










735 




er 


ro a 




y 


Pr 


ri 


Th 

r 




a 


Al 


rg 


I 0 


Cl 


rp 








740 










745 










7 SO 






y 


T L 




Glv 

y 


H ' s 
1 




Ala 


ijeu 


Gin 


S 


Gl 

^ y 


A 1 


T 

rP 


V 1 
a 


rg 














7 60 










765 








y 


Th 

77n r ° 


er 


Al 


lie 


He 


y 


eu 


Le 
eu 


Gl 

n 


V 1 


Gl 

U 


ro 


eu 


Th 

r 












7 7 S 










780 












Pr P 


Glu 


ri 

y 


ri 


Leu 


Ala 


Leu 


Thr 


He 


Leu 


y 


c er 


Asn 


Leu 


785 








790 










795 










800 


Gl 


Ara Ala 


Phe 


Ala 


- 

P 


Val 


Gin 


Tvr 
y 


Ala 


P 


Pro 


Val 


Leu 
u 




er 




























815 




eu 


Ser Pro 




rp 


Gl 

y 


Pro 


ri 

n 


Al 


y 


Gl 

y 


Thr 


Gin 




Th 

r 


.. 

e 






820 










82 5 










830 






rg 


ri ri 


is 


eu 


Gl 

n 


Th 

r 


' y 


PI ■ 
y 


sn 


TV, 


r 


Al 


ne 


V 1 

vai 


n 

y 
































y 




ys 


ro 


6 




y 


rg 




6 






rg 


y 


r 












855 










8*60 












ASP 


v 1 
a 


Val 


Gl 


Arg 




eu 


eu 




1 


rp 


eu 


n 


Al 








R7n 








875 








flp n 


sp 


Al Cl 

a u 


V 1 

a 


Gl 


Al 


er 


rg 


Ala 


ri 


Pro 


n 


sp 


ro 




p 

ro 








885 










890 














r 9 


Ara Ser 


_ 


y 


Ale 




Al 


Al' 


A 

P 


Pr 


Gl 


Al 






Gl 

n 






900 










qns 
















Leu 
U 


Gly Gly 


n 

y 


Leu 




Gin 




Thr 


Ala 


Glu 


Pro 


Scr 


er 


eu 


His 














920 










925 








eu 




Ala 


eu 


sn 


Al 




Gin 


y 


Ser 


^hr 


V 1 


ys 


er 


VI 












93 5 










940 
















eu 




T 

U 


Cys 


Arg 


Ser 
er 


p 


Al 

. a 


Val 


Pr- 0 


Asp 






















055 














His Pro 


cl 

n 




vai 


Phe 


Phe 


Thr 




sp 


Asv 


Val 


Gin 


V 1 


A 

Asp 








965 










97 0 










075 


Phe 


Ala Ser 


Ala 


Ser 


Gly 


Gly 


Gin 


Gly 


Phe 


Leu 


Tyr 


Gin 


Pro 


Asn 


Pro 






980 










985 










990 








Leu Ala 


Pro 


Leu 


Ser 


Arg 


Glu 


Gly 


Pro 


Ala 


Arg 


Pro 


Tyr 


Arg 


Leu 




995 










1000 








1005 






Lys 


Pro Gly 


His 


Val 


Leu 


Asp 


Val 


Glu 


Gly 


Glu 


Gly 


Leu 


Asn 




Gly 




1010 








1015 








1020 








lie 


Ser Lys 


Glu 


Glu 


Val 


Arg 


Val 


His 


He 


Gly 


Arg Gly Glu 


Cys 


Leu 
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1025 1030 1035 1040 

Val Lys Thr Leu Thr Arg Thr His Leu Tyr Cys Glu Pro Pro Ala His 

1045 1050 1055 

Ala Pro Gin Pro Ala Asn Gly Ser Gly Leu Pro Gin Phe Val Val Gin 

1060 1065 1070 

Met Gly Asn Val Gin Leu Ala Leu Gly Pro Val Gin Tyr Glu Ala Glu 

1075 1080 1085 

Pro Pro Leu Ser Ala Phe Pro Val Glu Ala Gin Ala Gly Val Gly Met 

1090 1095 1100 

Gly Ala Ala Val Leu He Ala Ala Val Leu Leu Leu Thr Leu Met Tyr 
1105 1110 1115 1120 

Arg His Lys Ser Lys Gin Ala Leu Arg Asp Tyr Gin Lys Val Leu Val 

1125 1130 1135 

Gin Leu Glu Ser Leu Glu Thr Gly Val Gly Asp Gin Cys Arg Lys Glu 

1140 1145 1150 

Phe Thr Asp Leu Met Thr Glu Met Thr Asp Leu Ser Ser Asp Leu Glu 

1155 1160 1165 

Gly Ser Gly He Pro Phe Leu Asp Tyr Arg Thr Tyr Ala Glu Arg Ala 

1170 1175 1180 

Phe Phe Pro Gly His Gly Gly Cys Pro Leu Gin Pro Lys Pro Glu Gly 
1185 1190 1195 1200 

Pro Gly Glu Asp Gly His Cys Ala Thr Val Arg Gin Gly Leu Thr Gin 

1205 1210 1215 

Leu Ser Asn Leu Leu Asn Ser Lys Leu Phe Leu Leu Thr Val Arg Ala 

1220 1225 1230 

Val Trp Arg Glu Cys Pro Val Gly Lys Glu Val Gly Leu Gly Asn Tyr 

1235 1240 1245 

Trp Pro Glu Thr Lys Val Gly Glu Glu Thr Glu Thr Met Val Glu Lys 

1250 1255 1260 

Leu Leu Thr Asn Trp Leu Ser He Cys Leu Tyr Ala Phe Leu Arg Glu 
1265 1270 1275 1280 

Val Ala Gly Glu Pro Leu Tyr Met Leu Phe Arg Ala He Gin Tyr Gin 

1285 1290 1295 

Val Asp Lys Gly Pro Val Asp Ala Val Thr Gly Lys Ala Lys Arg Thr 

1300 1305 1310 

Leu Asn Asp Ser Arg Leu Leu Arg Glu Asp Val Glu Phe Gin Pro Leu 

1315 1320 1325 

Thr Leu Met Val Leu Val Gly Pro Gly Ala Gly Gly Ala Ala Gly Ser 

1330 1335 1340 

Ser Glu Met Gin Arg Val Pro Ala Arg Val Leu Asp Thr Asp Thr He 
1345 1350 1355 1360 

Thr Gin Val Lys Glu Lys Val Leu Asp Gin Val Tyr Lys Gly Thr Pro 

1365 1370 1375 

Phe Ser Gin Arg Pro Ser Val His Ala Leu Asp Leu Gly Glu Arg Ala 

1380 1385 1390 

Ser Pro Ala His Pro Pro Gin Gly Pro Phe Pro Thr Pro Pro Ala Pro 

1395 1400 1405 

Gly Ala Pro Gin Leu Cys Leu Thr Met Asn He Pro Thr Leu Glu Asp 

1410 1415 1420 

Gly Glu Glu Gly Gly Val Cys Leu Trp His Leu Val Lys Ala Thr Glu 
1425 1430 1435 1440 

Glu Pro Glu Gly Ala Lys Val Arg Cys Ser Ser Leu Arg Glu Arg Glu 

1445 1450 1455 

Pro Ala Arg Ala Lys Ala He Pro Glu He Tyr Leu Thr Arg Leu Leu 
1460 1465 1470 
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Ser Met Lys Val Gly Ala Ala Trp Val Ala Gly Pro Glu Arg Arg Leu 

1475 1480 1485 

Ser Gin Gly Pro Arg Pro Ser Gin Gly Val Gly Gly Ala Gly Ala Ala 

1490 1495 1500 

Ser Ala Val Asp Gly Pro His Thr Leu Pro Ser Thr Gin Pro Leu Ser 
1505 1510 1515 1520 

Pro Ala Ser Gin Gly Thr Leu Gin Lys Phe Val Asp Asp Thr Phe Gin 

1525 1530 1535 

Ala lie Leu Ser Val Asn Arg Pro lie Pro lie Ala Val Lys Tyr Leu 

1540 1545 1550 

Phe Asp Leu Leu Asp Glu Leu Ala Glu Lys His Gly lie Glu Asp Pro 

1555 1560 1565 

Gly Thr Leu His lie Trp Lys Thr Asn Ser Leu Leu Leu Arg Phe Trp 

1570 1575 1580 

Val Asn Ala Leu Lys Asn Pro Gin Leu lie Phe Asp Val Arg Val Ser 
1585 1590 1595 1600 

Asp Asn Val Asp Ala He Leu Ala Val He Ala Gin Thr Phe He Asp 

1605 1610 1615 

Ser Cys Thr Thr Ser Glu His Lys Val Gly Arg Val Arg Ala Val Pro 

1620 1625 1630 

Ala Ala Ala Ala Gly Arg Gly Leu Arg Arg Lys Gly Leu Trp Gly Lys 

1635 1640 1645 

Pro Arg Gly Leu Cys Thr Glu Leu Trp Val Gly Ser Gly Ser He Met 

1650 1655 1660 

Gly Ala Pro Ser Pro Pro Ser Ser Cys Leu Ala Pro Pro Leu Pro Pro 
1665 1670 1675 1680 

Glu Gin Asp Ser Pro Val Asn Lys Leu Leu Tyr Ala Arg Glu He Pro 

1685 1690 1695 

Arg Tyr Lys Gin Met Val Glu Arg Tyr Tyr Ala Asp He Arg Gin Ser 

1700 1705 1710 

Ser Pro Ala Ser Tyr Gin Glu Met Asn Ser Ala Leu Ala Glu Leu Ser 

1715 1720 1725 

Gly Asn Tyr Thr Ser Ala Pro His Cys Leu Glu Ala Leu Gin Glu Leu 

1730 1735 1740 

Tyr Asn His He His Arg Tyr Tyr Asp Gin He He Ser Ala Leu Glu 
1745 1750 1755 1760 

Glu Asp Pro Val Gly Gin Lys Leu Gin Leu Ala Cys Arg Leu Gin Gin 

1765 1770 1775 

Val Ala Ala Leu Val Glu Asn Lys Val Thr Asp Leu 
1780 1785 



<210> 5 

<211> 5892 

<212> DNA 

<213> HOMO SAPIEN 



<400> 5 

gcgcacgccc ggatggctct tcgcgccgcg ggcggcgcac cctttagcgg cccggccgcc 60 

gctgccagcc ccccgccgtt ccagacgccg ccgcggtgcc cggtgccgct gctgttgctg 120 

ctgctcctgg gggcggcgcg ggccggcgcc ctggagatcc agcgtcggtt cccctcgccc 180 

acgcccacca acaacttcgc cctggacggc gcggcgggga ccgtgtacct ggcggccgtc 240 

aaccgcctct atcagctgtc gggcgccaac ctgagcctgg aggccgaggc ggccgtgggc 300 

ccggtgcccg acagcccgct gtgtcacgct ccgcagctgc cgcaggcctc gtgcgagcac 360 

ccgcggcgcc tcacggacaa ctacaacaag atcctgcagc tggaccccgg ccagggcctg 42 0 

gtagtcgtgt gcgggtccat ctaccagggc ttctgccagc tgcggcgccg gggtaacatc 480 
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tcggccgtgg ccgtgcgctt cccgcccgcc gcgccgcccg ccgagcccgt cacggtgttc 540 

cccagcatgc tgaacgtggc ggccaaccac ccgaacgcgt ccaccgtggg gctagttctg 600 

ccccccgccg cgggcgcggg gggcagccgc ctgctcgtgg gcgccacgta caccggttac 660 

ggcagctcct tcttcccgcg caaccgcagc ctggaggacc accgcttcga gaacacgccc 720 

gagatcgcca tccgctccct ggacacgcgc ggcgacctgg ccaagctctt caccttcgac 780 

ctcaacccct ccgacgacaa catcctcaag atcaagcagg gcgccaagga gcagcacaag 840 

ctgggcttcg tgagcgcctt cctgcacccg tccgacccgc cgccgggtgc acagtcctac 900 

gcgcacctgg cgctcaacag cgaggcgcgc gcgggcgaca aggagagcca ggcgcggagc 960 

ctgctggcgc gcatctgcct gccccacggc gccggcggcg acgccaagaa gctcaccgag 1020 

tccracatcc agttgggctt gcagtgcgcg ggcggcgcgg gccgcggcga cctctacagc 1080 

cgcctggtgt cggtcttccc agcccgggag cggctctttg ctgtcttcga gcggccccag 1140 

gggccccccg cggcccgcgc tgctccggcc gcactctgcg ccttccgctt cgccgacgtg 1200 

cgagccgcca tccgagctgc gcgcaccgcc tgcttcgtgg aaccggcgcc cgacgtggtg 12 6 0 

gcggtgctcg acagcgtggt gcagggcacg ggaccggcct gcgagcgcaa gctcaacatc 1320 

cacctccagc cagagcagct ggactgtgga gctgctcacc tgcagcaccc gctgtccatc 1380 

ctgcagcccc tgaaggccac gcccgtgttc cgcgccccgg gcctcacctc cgtggccgtg 1440 

gccagcgtca acaactacac agcggtcttc ctgggcacgg tcaacgggag gcttctcaag 1500 

atcaacctga acgagagcat gcaggtggtg agcaggcggg tggtgactgt ggcctatggg 1560 

gagcccgtgc accatgtcat gcagtttgac ccagcagact ccggttacct ttacctgatg 1620 

acgtcccacc agatggccag ggtgaaggtc gccgcctgca acgtgcactc cacctgtggg 1680 

gacrgcgtgg gtgcggcgga cgcctactgc ggctggtgtg ccctggagac gcggtgcacc 1740 

ttgcagcagg actgcaccaa ttccagccag cagcatttct ggaccagtgc cagcgagggc 1800 

cccagccgct gtcctgccat gaccgtcctg ccttccgaga tcgatgtgcg ccaggagtac 1860 

ccaggcatga tcctgcagat ctcgggcagc ctgcccagcc tcagtggcat ggagatggcc 1920 

tg-gactatg ggaacaacat ccgcactgtg gctcgggtcc caggccctgc ctttggtcac 1980 

cagattgcct actgcaacct cctgccgagg gaccagtttc cgcccttccc ccccaaccag 2040 

gaccacgtga ctgttgagat gtccgtgagg gtcaatgggc ggaacatcgt caaggccaat 2100 

ttcaccatct acgactgcag ccgcactgca caagtgtacc cccacacagc ctgtaccagc 2160 

tgcctgtcgg cacagtggcc ctgtttctgg tgcagccagc agcactcctg tgtttccaac 2220 

cagcctcggt gcgaggcctc accaaacccc acgagccctc aggactgccc ccggaccctg 2280 

ctcrcacccc tggcacccgt gcctaccggt ggctcccaga acatcctggt gcctctggcc 2340 

aacactgccn ttttccaggg tgcagccctg gagtgtagtt ttgggctgga ggagatcttc 2400 

gaggctgtgt gggtgaatga gtctgttgta cgctgtgacc aggtggtgct gcacacgacc 2460 

cggaagagcc aggtgttccc gctcagcctc caactaaagg ggcggccagc ccgattcctg 2520 

gacagccctg agcccatgac agtcatggtc tataactgtg ccatgggcag ccccgactgt 2580 

tcccagtgcc tgggccgcga agacctgggt cacctgtgcg tgtggagtga tggctgccgc 2640 

ctgcgggggc ctctgcagcc catggctggc acctgccccg cccccgagat ccgcgcgatt 2700 

gagcccctga gtggcccgtt ggacggtggg accctgctga ccatccgagg aaggaacctg 2760 

ggccggcggc tcagtgacgt ggcccacggc gtgtggattg gtggtgtggc ctgtgagcca 2820 

ctgcctgaca gatacacggt gtcggaggag atcgtgtgtg tcacagggcc agccccagga 2 88 0 

ccgctctcag gtgtggtgac cgtgaacgcc tctaaggagg gcaagtcccg ggaccgcttc 2940 

tccracgtgc tgcccctggt ccactccctg gagcctacca tgggccccaa ggccgggggc 3 000 

accaggatca ccatccatgg gaatgacctc catgtaggct ccgagctcca ggtcctggtg 3060 

aacgacacag acccctgcac ggagctgatg cgcacagata ccagcatcgc ctgcaccatg 312 0 

cccgaggggg ccctgccggc tccggtgcct gtgtgtgtgc gcttcgagcg tcggggctgc 3180 

gtgcacggca acctcacctt ctggtacatg cagaacccgg tcatcacggc catcagtccc 3240 

cgccgcagcc ctgtcagtgg cggcaggacc atcacagtgg ctggtgagcg tttccacatg 3300 

gtgcagaatg tgtccatggc cgtccaccac attggccggg agcccacgct ctgcaaggtt 3360 

ctcaactcca ccctcatcac ctgcccgtcc cccggggccc tgagcaacgc atcagcgcca 3420 

gtggacttct tcatcaatgg gcgggcctac gcagacgagg tggctgtggc tgaggagcta 3480 

ctggaccccg aggaggcaca gcggggcagc aggttccgcc tggactacct ccccaaccca 3540 

cagrtctcta cggccaagag ggagaagtgg atcaagcacc accccgggga gcctctcacc 3 600 

ctcgttatcc acaaggagca ggacagcctg gggctccaga gtcacgagta ccgggtcaag 3 660 

ataggccaag taagctgcga catccagatt gtctctgaca gaatcatcca ctgctcggtc 3720 

aacgagtccc tgggcgcggc cgtggggcag ctgcccatca caatccaggt agggaacttc 3780 
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aaccagacca tcgccacact gcagctgggg ggcagcgaga cggccatcat cgtgtccatc 3 840 

gtcatctgca gcgtcctgct gctgctctcc gtggtggccc tgttcgtctt ctgtaccaag 3900 

agccgacgtg ctgagcgtta ctggcagaag acgctgctgc agatggagga gatggaatct 3960 

cagatccgag aggaaatccg caaaggcttc gctgagctgc agacagacat gacagatctg 4020 

accaaggagc tgaaccgcag ccagggcatc cccttcctgg agtataagca cttcgtgacc 4080 

cgcaccttct tccccaagtg ttcctccctt tatgaagagc gttacgtgct gccctcccag 4140 

accctcaact cccagggcag ctcccaggca caggaaaccc acccactgct gggagagtgg 4200 

aagattcctg agagctgccg gcccaacatg gaagagggaa ttagcgtgtt ctcctcacta 4260 

ctcaacaaca agcacttcct catcgtcttt gcccacgcgc tggagcagca gaaggacttt 4320 

gcggtgcgcg acaggtgcag cctggcctcg ctgctgacca tcgcgctgca cggcaagctg 4380 

gagtactaca ccagcatcat gaaggagctg ctggtggacc tcattgacgc ctcggccgcc 4440 

aagaacccca agctcatgct gcggcgcaca gagtctgtgg tggagaagat gctcaccaac 4500 

tggatgtcca tctgcatgta cagctgtctg cgggagacgg tgggggagcc attcttcctg 4560 

ccgctgtgtg ccatcaagca gcaaatcaac aagggctcca tcgacgccat cacaggcaag 4620 

gcccgctaca cactcaatga ggagtggctg ctgcgggaga acatcgaggc caagccccgg 4680 

aacctgaacg tgtccttcca gggctgtggc atggactcgc tgagcgtgcg ggccatggac 4740 

accgacacgc tgacacaggt caaggagaag accctggagg ccttctgcaa gaatgtgccc 4800 

tactcccagt ggccgcgtgc agaggacgtc gaccttgagt ggttcgcctc cagcacacag 4860 

agctacatcc ttcgggacct ggacgacacc tcagtggtgg aagacggccg caagaagctt 4920 

aacacgctgg cccattacaa gatccctgaa ggtgcctccc tggccatgag tctcatagac 4980 

aagaaggaca acacactggg ccgagtgaaa gacttggaca cagagaagta tttccatttg 5040 

gtgctgccta cggacgagct ggcggagccc aagaagtctc accggcagag ccatcgcaag 5100 

aaggtgctcc cggaaatcta cctgacccgc ctgctctcca ccaagggcac gttgcagaag 5160 

tttctggatg acctgttcaa ggccattctg agtatccgtg aagacaagcc cccactggct 5220 

gtcaagtact ttttcgactt cctggaggag caggctgaga agaggggaat ctccgacccc 5280 

gacaccctac acatctggaa gaccaacagc cttcctcccc ggttctgggt gaacatcctg 5340 

aagaaccccc agtttgtctt tgacatcgac aagacagacc acatcgacgc ctgcctttca 5400 

gtcatcgcgc aggccttcat cgacgcctgc tccatctctg acctgcagct gggcaaggat 5460 

tcgccaacca acaagctcct ctacgccaag gagattcctg agtaccggaa gatcgtgcag 5520 

cgctactaca agcagatcca ggacatgacg ccgctcagcg agcaagagat gaatgcccat 558 0 

ctggccgagg agtcgaggaa ataccagaat gagttcaaca ccaatgtggc catggcagag 5640 

atttttaggt cgcccaagag gtatcggccg cagatcatgg ccgcgctgga ggccaacccc 5700 

acggcccgga ggacacaact gcagcacaag tttgagcagg tggtggcttt gatggaggac 5760 

aacatctacg agtgctacag tgaggcctga gacacatgga gagttggtca ggctgctgct 5820 

gggagaaatg gacgcccact gggcctcaac ttgatcttct accccgtgcc tgtgactcag 5880 

actgggaaat ac 5892 
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65 
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Gin 


Leu 


Ser 


Gly 


Ala 


Asn 


Leu 


Ser 


Pro 


Val 


Pro 


Asp 


85 
Ser 


Pro 


Leu 


Cys 








100 










Ser 


Cys 


Glu 


His 


Pro 


Arg 


Arg 


Leu 






115 










12 0 


Gin 


Leu 


Asp 


Pro 


Gly 


Gin 


Gly 


Leu 




130 










135 




Gin 


Gly 


Phe 


Cys 


Gin 


Leu 


Arg 


Arg 


145 










150 






Val 


Arg 


Phe 


Pro 


Pro 


Ala 


Ala 


Pro 










165 








Pro 


Ser 


Met 


Leu 


Asn 


Val 


Ala 


Ala 








180 










Gly 


Leu 


Val 


Leu 


Pro 


Pro 


Ala 


Ala 






195 










200 


Val 


Gly Ala 


Thr 


Tyr 


Thr 


Gly 


Tyr 
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215 




Arg 
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Leu 


Clu 


Asp 


His 


Arg 


Phe 


225 










230 






Arg 


Ser 


Leu 


Asp 


Thr 


Arg 


Gly 


Asp 










245 








Leu 


Asn 


Pro 


Ser 


Asp 


Asp 


Asn 


He 








260 










Glu 


Gin 


His 


Lys 


Leu 


Gly 


Phe 


Val 
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Pro 


Pro 


Pro 


Gly 


Ala 


Gin 


Ser 


Tyr 




290 










295 




Ala 


Arg Ala 


Gly 


Asp 


Lys 


Glu 


Ser 


305 
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lie 


Cys 


Leu 


Pro 


His 


Gly 


Ala 


Gly 










325 








Ser Tyr 


He 


Gin 


Leu 


Gly 


Leu 


Gin 








340 










Asp 


Leu 


Tyr 


Ser 


Arg 


Leu 


Val 


Ser 






355 










360 


Phe 


Ala 


Val 


Phe 


Glu 


Arg 


Pro 


Gin 




370 
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Pro 


Ala 


Ala 
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Cys 


Ala 


Phe 


Arg 
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Arg 


Ala 


Ala 


Arg 
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Ala 
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Val 
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Val 


Val 
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Lys 


Leu 
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He 


Gin 


Leu 


Glr. 


Pro 






435 










440 


His 


Leu 


Gin 


His 


Pro 


Leu 


Ser 


He 




450 










455 




Val 


Phe Arg 


Ala 


Pro 


Gly 




Thr 


465 
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Asn Tyr Thr 


Ala 


Val 


Phe 
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Gly 
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Val 
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Glu 
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Glu 
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Val 


Gly 




90 
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Ala 


Pro 
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Leu 
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Gin 


Ala 
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Thr 


Asp 


Asn 
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Asn 
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He 


Leu 










125 








Val 


Val 


Val 
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Gly 
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He 
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Arg 


Gly 
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He 
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Val 


Ala 
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Pro 


Ala 


Glu 


Pro 


Val 
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Val 


Phe 
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Asn 
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Ala 
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Thr 


Val 
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Gly 


Ala 


Gly 


Gly 
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Arg 
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Ser 
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Phe 
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Glu 


Asn 


Thr 


Pro 


Glu 


He 


Ala 


He 






235 










240 


Leu 


Ala 


Lys 


Leu 


Phe 


Thr 


Phe 
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Cys 


Thr 
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Leu Gin 


Gin 


Asp 
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Cys 


Thr 


Asn 


Ser 


Ser 
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Gin 


Gin 


His 


Phe 


Trp 
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Thr 


Ser 


Ala Ser 


Glu 
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Gly 


Pro 


Ser 


Arg 


Cys 
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Pro 


Ala 


Met 


Thr 


Val 
505 


Leu 


Pro 


Ser 


Glu He 


Asp 


Val 


Arg 


Gin 


Glu 


Tyr 


Pro Gly 


Met 


He 


Leu 


Gin 


He 


Ser 
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615 










620 










Gly Ser 


Leu 


Pro 


Ser 


Leu 


Ser 


Gly 


Met 


Glu 


Met 


Ala 


Cys 


Asp 


Tyr 


Gly 


625 
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635 
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Asn Asn 


He 


Arg 


Thr 
645 


Val 


Ala 


Arg 


Val 


Pro 
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Gly 


Pro 


Ala 


Phe 


Gly 
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His 


Gin He 


Ala 


Tyr 
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Cys 


Asn 


Leu 


Leu 


Pro 
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Arg 


Asp 


Gin 


Phe 


Pro 
670 


Pro 


Phe 


Pro Pro 
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Gin 


Asp 


His 


Val 


Thr 
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Val 


Glu 


Met 


Ser 


Val 
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Arg 


Val 


Asn 


Gly Arg 


Asn 


He 


Val 


Lys 


Ala 


Asn 


Phe 


Thr 


He 


Tyr 


Asp 


Cys 


Ser 


Arg 
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Thr Ala 


Gin 


Val 
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Thr 
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Thr 


Ser 
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Trp Cys 


Ser 
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Ser 
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Val 


Ser 
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Gin Ser 


Arg 


Cys 
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Glu 


Ala 


Ser 


Pro 


Asn 
745 


Pro 


Thr 


Ser 


Pro 
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Cys 


Pro Arg 


Thr 
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Leu 


Ser 


Pro 


Leu 


Ala 


Pro 


Val 


Pro 


Thr 


Gly 


Gly 


Ser 


755 
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Gin Asn 


He 


Leu 


Val 


Pro 


Leu 


Ala 


Asn 


Thr 


Ala 


Phe 


Phe 


Gin 


Gly 


Ala 
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Ala Leu 


Glu 
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Ser 


Phe 


Gly 


Leu 


Glu 


Glu 
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Phe 


Glu 


Ala 


Val 


Trp 
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Val Asn 


Glu 


Ser 


Val 


Val 


Arg Cys 


Asp 


Gin 


Val 


Val 


Leu 


His 


Thr 


Thr 
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815 




Arg Lys 


Ser 


Gin 


Val 


Phe 


Pro 


Leu 


Ser 


Leu 


Gin 


Leu 


Lys 


Gly 


Arg 


Pro 




820 










825 










830 






Ala Arg 


Phe 


Leu Asp 


Ser 


Pro 


Glu 


Pro 


Met 


Thr 


Val 


Met 


Val 


Tyr 


Asn 


835 










840 










845 








Cys Ala 


Met 


Gly 


Ser 


Pro Asp Cys 


Ser 


Gin 


Cys 


Leu 


Gly 


Arg 


Glu 


Asp 


850 
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Leu Gly 


His 


Leu 


Cys 


Val Trp 


Ser 


Asp Gly 


Cys 


Arg 


Leu 


Arg 


Gly 


Pro 


865 
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Pro 


Met 


Ala 


Gly Thr Cys 


Pro 


Ala 


Pro 


Glu 


He 


Arg 


Ala 


He 
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Glu Pro 


Leu 


Ser 


Gly 


Pro 


Leu Asp 


Gly Gly 
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Gly Arg 
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Gly 




Arg 
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Val 
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Gly 


Val 


Trp 
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He Gly 


Gly 


Val 
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Pro 


Asp 


Arg 
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Val 


Ser 
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Val 


Val 


Thr 


Val 


Asn 
965 


Ala 




Lys 


Glu 


Giy 
970 


Lys 


Ser Arg Asp 


Arg 
975 


Phe 


Ser 


Tyr Val 


Leu 


Pro 




Val 


His 


Ser 


Leu 




Pro Thr Met 


Gly 


Pro 








980 










985 






990 






Lys 


Ala 


Gly 


Gly 


Thr 


Arg 


He 


Thr 


He 


His 


Gly 


Asn Asp Leu 




V 1 
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1000 
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Gly 


Ser 


Glu 


Leu 


Gin 


Val 




Val 






Thr 


Asp Pro Cys 


Thr 


Glu 




1010 








1015 








1020 






Leu 


Met 




Thr 


Asp 


Thr 


Ser 


He 


Ala 


Cys 


Thr 


Met Pro Glu 


Gly 


Ala 


1025 








1030 








1035 




1040 


Leu 


Pro 


Ala 


Pro 


Val 


Pro 


Val 


Cys 


Val 


Arg 


Phe 


Glu Arg Arg 


Gly 


Cys 










1045 






1050 
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Val 


His 


Gly 


Asn 


Leu 


Thr 


Phe Trp 


Tyr 


Met 


Gin 


Asn Pro Val 


He 


Thr 






1060 








1065 




1070 




Ala 


He 


Ser 


Pro 


Arg 


Arg Ser 


Pro 


Val 


Ser 


Gly Gly Arg Thr 


He 


Thr 



1075 1080 1085 



Val Ala Gly Glu Arc Phe His Met Val Gin Asn Val Ser Met Ala Val 

1090 1095 1100 

His His He Gly Arg Glu Pro Thr Leu Cys Lys Val Leu Asn Ser Thr 
1105 1110 1115 1120 

Leu He Thr Cys Pro Ser Pro Gly Ala Leu Ser Asn Ala Ser Ala Pro 

1125 1130 1135 

Val Asp Phe Phe He Asn Gly Arg Ala Tyr Ala Asp Glu Val Ala Val 

1140 1145 1150 

Ala Glu Glu Leu Leu Asp Pro Glu Glu Ala Gin Arg Gly Ser Arg Phe 

1155 1160 1165 

Arg Leu Asd Tyr Leu Pro Asn Pro Gin Phe Ser Thr Ala Lys Arg Glu 

1170 1175 1180 

Lys Trp He Lys His His Pro Gly Glu Pro Leu Thr Leu Val He His 
1185 1190 1195 1200 

Lys Glu Gin Asp Ser Leu Gly Leu Gin Ser His Glu Tyr Arg Val Lys 

1205 1210 1215 

He Gly Gin Val Ser Cys Asp He Gin He Val Ser Asp Arg He He 

1220 1225 1230 

His Cys Ser Val Asn Glu Ser Leu Gly Ala Ala Val Gly Gin Leu Pro 

1235 1240 1245 

He Thr He Gin Val Gly Asn Phe Asn Gin Thr He Ala Thr Leu Gin 

1250 1255 1260 

Leu Gly Gly Ser Glu Thr Ala He He Val Ser He Val He Cys Ser 
1265 1270 1275 1280 

Val Leu Leu Leu Leu Ser Val Val Ala Leu Phe Val Phe Cys Thr Lys 

1285 1290 1295 

Ser Arg Arg Ala Glu Arg Tyr Trp Gin Lys Thr Leu Leu Gin Met Glu 

1300 1305 1310 

Glu Met Glu Ser Gin He Arg Glu Glu He Arg Lys Gly Phe Ala Glu 

1315 1320 1325 

Leu Gin Thr Asp Met Thr Asp Leu Thr Lys Glu Leu Asn Arg Ser Gin 

1330 1335 1340 

Gly He Pro Phe Leu Glu Tyr Lys His Phe Val Thr Arg Thr Phe Phe 
1345 1350 1355 1360 

Pro Lys Cys Ser Ser Leu Tyr Glu Glu Arg Tyr Val Leu Pro Ser Gin 

1365 1370 1375 

Thr Leu Asn Ser Gin Gly Ser Ser Gin Ala Gin Glu Thr His Pro Leu 

1380 1385 1390 

Leu Gly Glu Trp Lys He Pro Glu Ser Cys Arg Pro Asn Met Glu Glu 
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1395 1400 1405 

Gly lie Ser Val Phe Ser Ser Leu Leu Asn Asn Lys His Phe Leu He 

1410 1415 1420 

Val Phe Val His Ala Leu Glu Gin Gin Lys Asp Phe Ala Val Arg Asp 
1425 1430 1435 1440 

Arg Cys Ser Leu Ala Ser Leu Leu Thr He Ala Leu His Gly Lys Leu 

1445 1450 1455 

Glu Tyr Tyr Thr Ser He Met Lys Glu Leu Leu Val Asp Leu He Asp 

1460 1465 1470 

Ala Ser Ala Ala Lys Asn Pro Lys Leu Met Leu Arg Arg Thr Glu Ser 

1475 1480 1485 

Val Val Glu Lys Met Leu Thr Asn Trp Met Ser He Cys Met Tyr Ser 

1490 1495 1500 

Cys Leu Arg Glu Thr Val Gly Glu Pro Phe Phe Leu Leu Leu Cys Ala 
1505 1510 1515 1520 

He Lys Gin Gin He Asn Lys Gly Ser He Asp Ala He Thr Gly Lys 

1525 " 1530 1535 

Ala Arg Tyr Thr Leu Asn Glu Glu Trp Leu Leu Arg Glu Asn He Glu 

1540 1545 1550 

Ala Lys Pro Arg Asn Leu Asn Val Ser Phe Gin Gly Cys Gly Met Asp 

1555 1560 1565 

Ser Leu Ser Val Arg Ala Met Asp Thr Asp Thr Leu Thr Gin Val Lys 

1570 1575 1580 

Glu Lys He Leu Glu Ala Phe Cys Lys Asn Val Pro Tyr Ser Gin Trp 
1585 1590 1595 1600 

Pro Arg Ala Glu Asp Val Asp Leu Glu Trp Phe Ala Ser Ser Thr Gin 

1605 1610 1615 

Ser Tyr He Leu Arg Asp Leu Asp Asp Thr Ser Val Val Glu Asp Gly 

1620 1625 1630 

Arg Lys Lys Leu Asn Thr Leu Ala His Tyr Lys lie Pro Glu Gly Ala 

1635 1640 1645 

Ser Leu Ala Met Ser Leu lie Asp Lys Lys Asp Asn Thr Leu Gly Arg 

1650 1655 1660 

Val Lys Asp Leu Asp Thr Glu Lys Tyr Phe His Leu Val Leu Pro Thr 
1665 1670 1675 1680 

Asp Glu Leu Ala Glu Pro Lys Lys Ser His Arg Gin Ser His Arg Lys 

1685 1690 1695 

Lys Val Leu Pro Glu He Tyr Leu Thr Arg Leu Leu Ser Thr Lys Gly 

1700 1705 1710 

Thr Leu Gin Lys Phe Leu Asp Asp Leu Phe Lys Ala He Leu Ser He 

1715 1720 1725 

Arg Glu Asp Lys Pro Pro Leu Ala Val Lys Tyr Phe Phe Asp Phe Leu 

1730 1735 1740 

Glu Glu Gin Ala Glu Lys Arg Gly He Ser Asp Pro Asp Thr Leu His 
1745 1750 1755 1760 

He Trp Lys Thr Asn Ser Leu Pro Leu Arg Phe Trp Val Asn He Leu 

1765 1770 1775 

Lys Asn Pro Gin Phe Val Phe Asp He Asp Lys Thr Asp His He Asp 

1780 1785 1790 

Ala Cys Leu Ser Val He Ala Gin Ala Phe He Asp Ala Cys Ser He 

1795 1800 1805 

Ser Asp Leu Gin Leu Gly Lys Asp Ser Pro Thr Asn Lys Leu Leu Tyr 

1810 1815 1820 

Ala Lys Glu He Pro Glu Tyr Arg Lys He Val Gin Arg Tyr Tyr Lys 
1825 1830 1835 1840 
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Gin lie Gin Asp Met Thr Pro Leu Ser Glu Gin Glu Met Asn Ala His 

1845 1850 1855 

Leu Ala Glu Glu Ser Arg Lys Tyr Gin Asn Glu Phe Asn Thr Asn Val 

1860 1865 1870 

Ala Met Ala Glu lie Phe Arg Ser Pro Lys Arg Tyr Arg Pro Gin He 

1875 1880 1885 

Met Ala Ala Leu Glu Ala Asn Pro Thr Ala Arg Arg Thr Gin Leu Gin 

1890 1895 1900 

His Lys Phe Glu Gin Val Val Ala Leu Met Glu Asp Asn He Tyr Glu 
1905 1910 1915 1920 

Cys Tyr Ser Glu Ala 
1925 



<210> 7 

<211> 601 

<212> DNA 

<213> HOMO SAPIEN 



<400> 7 

caccagagtc cctgtggagt cctgtggtca gtatcagagc tgcggcgagt gccttggctc 60 

aggcgacccc cactgtggct ggtgtgtgct gcacaacact tgcacccgga aggagcggtg 12 0 

tgagcggtcc aaggagcccc gcaggtttgc ctcggagatg aagcagtgtg tccggctgac 180 

ggtccatccc aacaatatct ccgtctctca gtacaacgcg ctgctggtcc tggagacgta 240 

caatgtcccg gagctgtcag ctggcgtcaa ctgcaccttt gaggacctgt cagagatgga 300 

tgggctggtc gtgggcaatc agatccagtg ctactcccct gcagccaagg aggtgccccg 360 

gatcatcaca gagaatgggg accaccatgt cgtacagctt cagctcaaat caaaggagac 42 0 

cggcatgacc ttcgccagca ccagctttgt cttctacaat tgcagcgtcc acaattcgtg 480 

cctgtcctgc gtggagagtc cataccgctg ccactggtgt aaataccggc atgtctgcac 540 

ccatgacccc aagacctgct ccttccagga aggccgagtg aagctgcccg aggtaggtcc 600 

c 601 

<210> 8 

<211> 199 

<212> PRT 

<213> HOMO SAPIEN 



<400> 8 

Thr Arg Val Pro Val Glu Ser Cys Gly Gin Tyr Gin Ser Cys Gly Glu 

15 10 15 

Cys Leu Gly Ser Gly Asp Pro His Cys Gly Trp Cys Val Leu His Asn 

20 25 30 

Thr Cys Thr Arg Lys Glu Arg Cys Glu Arg Ser Lys Glu Pro Arg Arg 

35 40 45 

Phe Ala Ser Glu Met Lys Gin Cys Val Arg Leu Thr val His Pro Asn 

50 55 60 

Asn He Ser Val Ser Gin Tyr Asn Ala Leu Leu Val Leu Glu Thr Tyr 
65 70 75 80 

Asn Val Pro Glu Leu Ser Ala Gly Val Asn Cys Thr Phe Glu Asp Leu 

85 90 95 

Ser Glu Met Asp Gly Leu Val Val Gly Asn Gin He Gin Cys Tyr Ser 

100 105 110 

Pro Ala Ala Lys Glu Val Pro Arg He He Thr Glu Asn Gly Asp His 

115 120 125 

His Val Val Gin Leu Gin Leu Lys Ser Lys Glu Thr Gly Met Thr Phe 
130 135 140 
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Ala 


Ser Thr 


Ser 


Phe 


Val 


Phe 


Tyr Asn 


Cys 


Ser 


Val His 


Asn Ser Cys 


145 








150 








155 




160 


Leu 


Ser Cys 


Val 


Glu 


Ser 


Pro 


Tyr Arg 


Cys 


His 


Trp Cys 


Lys Tyr Arg 








165 








170 






175 


His 


Val Cys 


Thr 


Asp 


Pro 


Lys 


Thr Cys 


Ser 


Phe 


Gin Glu 


Gly Arg Val 






180 








185 








190 


Lys 


Leu Pro 


Glu 


Val 


Gly 


Pro 













195 



<210> 9 

<211> 6408 

<212> DNA 

<213> HOMO SAPIEN 

<400> 9 

atgcctgctc tgggcccagc tcttctccag gctctctggg ccgggtgggt cctcaccctc 60 

cagccccttc caccaactgc attcactccc aatggcacgt atctgcagca cctggcaagg 120 

gaccccacct caggcaccct ctacctgggg gctaccaact tcctgttcca gctgagccct 180 

gggctgcagc tggaggccac agtgtccacc ggccctgcgc tagacagcag ggactgcctg 240 

ccacctgtga tgcctgatga gcgcccccag gcccagccta ccaacaaccc gaatcagctg 300 

ctcctggtga gcccaggggc cctggtggta tgcgggagcg tgcaccaggg ggtctgtgaa 3 60 

cagcggcgcc tggggcagct cgagcagctg ctgctgcggc cagagcggcc tggggacaca 420 

caatatgtgg ctgccaatga Ccctgcggtc agcacggtgg ggctggtagc ccagggcttg 480 

gcaggggagc ccctcctgtt tgtggggcga ggatacacca gcaggggrgt ggggggtggc 540 

attccaccca tcacaacccg ggccctgtgg ccgcccgacc cccaagccgc cttctcctat 600 

gaggagacag ccaagctggc agtgggccgc ctctccgagt acagccacca cttcgtgagt 660 

gcctttgcac gtggggccag cgcctacttc ctgttcctgc ggcgggacct gcaggctcag 720 

tctagagctt ttcgtgccta tgtatctcga gtgtgtctcc gggaccagca ctactactcc 780 

tatgtggagt tgcctctggc ctgcgaaggt ggccgcnacg ggctgatcca ggctgcagct 840 

gtggccacgt ccagggaggt ggcgcatggg gaggcgctct ttgcagcttt ctcctcggct 900 

gcacccccca ctgtgggccg gcccccatcg gcggctgctg gggcatctgg agcctctgcc 960 

ctctgtgcct tccccctgga tgaggtggac cggcttgcta atcgcacgcg agatgcctgc 1020 

tacacccggg agggtcgtgc tgaggatggg accgaggtgg cctacatcga gtatgatgtc 1080 

aattctgact gtgcacagct gccagtggac accccggatg cttatccctg tggctcagac 1140 

oacacgccca gccccatggc cagccgggtc ccgctggaag ccacaccaat tctggagtgg 1200 

ccagggattc agctaacagc tgtggcagtc accatggaag atggacacac catcgctttc 1260 

ctgggtgata gtcaagggca gctgcacagg gtctacttgg gcccagggag cgatggccac 13 20 

ccatactcca cacagagcat ccagcagggg tctgcagtga gcagagacct cacctttgat 1380 

gggacctttg agcacctgta tgtcatgacc cagagcacac ttctgaaggt tcctgtggct 144 0 

tcctgtgctc agcacctgga ctgtgcatct tgccttgctc acagggaccc atactgtggg 1500 

tggtgcgtgc tccttggcag gtgcagtcgc cgttctgagt gctcgagggg ccagggccca 1560 

gagcagtggc tatggagctt ccagcctgag ctgggctgtc Cgcaagtggc agccatgagt 1620 

cctgccaaca tcagccgaga ggagacgagg gaggttttcc tatcagtgcc agacctgcca 1680 

cccctgtggc caggggagtc atattcctgc cactttgggg aacatcagag tcctgccctg 1740 

ctgactggtt ctggtgtgat gtgcccctcc ccagacccta gtgaggcccc agtgctgccg 1800 

agaggagccg actacgtatc cgtgagcgtg gagcCcagat ttggcgctgt tgtgatcgcc 1860 

aaaacttccc tctctttcta tgactgtgtg gcggccactg aactccgccc atctgcgcag 1920 

tgccaggcct gtgtgagcag ccgctggggg tgtaactggt gtgtctggca gcacctgtgc 1980 

acccacaagg cctcgtgtga tgctgggccc atggttgcaa gccatcagag cccgcttgtc 2040 

tccccagacc ctcctgcaag aggtggaccc agcccctccc cacccacagc ccccaaagcc 2100 

ctggccaccc ctgctcctga cacccttccc gtggagcctg gggctccctc cacagccaca 2160 

gcttcggaca tctcacctgg ggctagtcct tccctgctca gcccctgggg gccatgggca 2220 

ggttctggct ccatatcttc ccctggctcc acagggtcgc ctctccatga ggagccctcc 2280 

cctcccagcc cccaaaatgg acctggaacc gctgtccctg cccccactga cttcagaccc 2340 

tcagccacac ctgaggacct cttggcctcc ccgctgtcac cgtcagaggt agcagcagtg 2400 
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ccccctgcag accctggccc cgaggctctt catcccacag tgcccctgga cctgccccct 2460 

gccactgttc ctgccaccac tttcccaggg gccatgggct ccgtgaagcc cgccctggac 252 0 

tggctcacga gagaaggcgg cgagctgccc gaggcggacg agtggacggg gggtgacgca 2580 

cccgccttct ccacttccac cctcctctca ggtgatggag actcagcaga gcttgagggc 2640 

cctcccgccc ccctcatcct cccgtccagc ctcgactacc agtatgacac ccccgggctc 2700 

tgggagctgg aagaggcgac cttgggggca agctcccgcc cctgtgtgga gagcgttcag 2760 

ggctccacgt tgatgccggt ccatgtggag cgggaaatcc ggctgctagg caggaacctg 2820 

caccttttcc aggatggccc aggagacaat gagtgtgtga tggagctgga gggcctcgag 2880 

gtggtggttg aggcccgggt cgagtgtgag ccacctccag atacccagtg ccatgtcacc 2940 

tgccagcagc accagctcag ctatgaggct ctgcagccgg agctccgtgt ggggctgttt 3 000 

ctgcgtcggg ccggccgtct gcgtgtggac agtgctgagg ggctgcatgt ggtactgtat 3060 

gactgttccg tgggacatgg agactgcagc cgctgccaaa ctgccatgcc ccagtatggc 312 0 

tgcgtgtggt gtgaggggga gcgtccacgt tgtgtgaccc gggaggcctg tggtgaggct 318 0 

gaggctgtgg ccacccagtg cccagcgccc ctcatccact cggtggagcc actgactggg 3240 

cctgtagacg gaggcacccg tgtcaccatc aggggctcca acctgggcca gcatgtgcag 3300 

gatgtgctgg gcatggtcac ggtggctgga gtgccctgtg ctgtggatgc ccaggagtac 3360 

gaggtctcca gcagcctcgt gtgcatcacc ggggccagtg gggaggaggt ggccggcgcc 3420 

acagcggtgg aggtgccggg aagaggacgt ggtgtcccag aacacgactt tgcctaccag 3480 

gatccgaagg tccattccat cttcccggcc cgcggcccca gagctggggg cacccgtctc 3540 

accctgaatg gctccaagct cctgactggg cggctggagg acatccgagt ggtggttgga 3600 

gaccagcctt gtcacttgct gccggagcag cagtcagaac aactgcggtg tgagaccagc 3660 

ccacgcccca cgcctgccac gctccctgtg gctgtgtggt ttggggccac ggagcggagg 3720 

cttcaacgcg gacagttcaa gtataccttg gaccccaaca tcacctctgc tggccccacc 3780 

aagagcttcc tcagtggagg acgtgagata tgcgtccgtg gccagaatct ggacgtggta 3840 

cagacgccaa gaatccgggt gaccgtggtc tcgagaatgc tgcagcccaa ccaggggctt 3900 

ggacggaggc gtcgcgtggt cccggagacg gcatgttccc ttggaccctc ctgcagtagc 3960 

caccaatttg aggagccgtg ccatgtcaac tcctcccagc tcatcacgtg ccgcacacct 402 0 

gccctcccag gcctgcctga ggacccctgg gtccgggtgg aatttatcct tgacaacctg 4080 

gtccttgact ttgcaacacc gaaccccaca cctttctcct atgaggccga ccccaccctg 4140 

cagccactca accctgagga ccccaccatg ccattccggc acaagcctgg gagtgtgttc 4200 

tccgtggagg gggagaacct ggaccttgca atgtccaagg aggaggtggt ggctatgata 4260 

ggggatggcc cctgcgtggc gaagacgctg acgcggcacc acctgtactg cgagcccccc 432 0 

gtggagcagc ccctgccacg gcaccatgcc ctccgagagg cacctgactc tttgcctgag 43 8 0 

ttcacggtgc agatggggaa cttgcgcttc tccctgggtc acgtgcagta tgacggcgag 4440 

agccctgggg cttttcctgt ggcagcccag gtgggcttgg gggtgggcac ctctcttctg 4500 

gctctgggtg tcatcatcat tgtcctcatg tacaggagga agagcaagca ggccctgagg 4560 

gactataaga aggttcagat ccagctggag aatctggaga gcagtgtgcg ggaccgctgc 462 0 

aagaaggaat tcacagacct catgactgag atgaccgatc tcaccagtga cctcctgggc 4680 

agcggcatcc ccttcctcga ctacaaggtg tatgcggaga ggatcttctt ccctgggcac 474 0 

cgcgagtcgc ccttgcaccg ggacctgggt gtgcctgaga gcagacggcc cactgtagag 4800 

caagggctgg ggcagctctc taacctgctc aacagcaagc tcttcctcac caagttcatc 4860 

cacacgctgg agacccagcg caccttttca gctcgggacc gtgcctacgt ggcatctctg 4920 

ctcaccgtgg cactgcatgg gaagcttgag tatttcactg acatcctccg cactctgctc 4980 

agtgacctgg ttgcccagta tgtggccaag aaccccaagc tgatgctgcg caggacagag 5040 

accgtggtgg agaagctgct caccaactgg atgtccatct gtctgtatac cttcgtgagg 5100 

gaccccgtag gggagcctct gtacatgctc tttcgaggga ttaagcacca agtggataag 5160 

gggccagtgg acagtgtgac aggcaaggcc aaatacacct tgaacgacaa ccgcctgctc 522 0 

agagaggatg tggagtaccg tcccctgacc ttgaatgcac tattggctgt ggggcctggg 5280 

gcaggagagg cccagggcgt gcccgtgaag gtcctagact gtgacaccat ctcccaggca 534 0 

aaggagaaga tgctggacca gctttataaa ggagtgcctc tcacccagcg gccagaccct 5400 

cgcacccttg atgttgagtg gcggtctggg gtggccgggc acctcattct ttctgacgag 5460 

gacgtcactt ctgaggtcca gggtctgtgg aggcgcctga acacactgca gcattacaag 5520 

gtcccagatg gagcaactgt ggccctcgtc ccctgcctca ccaagcatgt gctccgggaa 5580 

aaccaggatt atgtccctgg agagcggacc ccaatgctgg aggatgtaga tgaggggggc 5 640 

atccggccct ggcacctggt gaagccaagt gatgagccgg agccgcccag gcctcggagg 5700 
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ggcagccttc ggggcgggga gcgtgagcgc gccaaggcca tccctgagat ctacctgacc 5760 

cgcctgctgt ccatgaaggg caccctgcag aagttcgtgg atgacctgtt ccaggtgatt 5820 

ctcagcacca gccgccccgt gccgctcgct gtgaagtact tctttgacct gctggatgag 5880 

caggcccagc agcatggcat ctccgaccag gacaccatcc acatctggaa gaccaacagc 5940 

ttgcctctga ggttctggat caatataata aaaaacccgc agtttgtgtt cgacgtgcaa 6000 

acatctgata acatggatgc ggtgctcctt gtcattgcac agaccttcat ggacgcctgc 6060 

accctggccg accacaagct gggccgggac tccccgatca acaaacttct gtatgcacgg 6120 

gacattcccc ggtacaagcg gatggtggaa aggtactatg cagacatcag acagactgtc 6180 

ccagccagcg accaagagat gaactctgtc ctggctgaac tgtcctggaa ccactccgga 6240 

gacctcgggg cgcgagtggc cctgcatgaa ctctacaagt acatcaacaa gtactatgac 6300 

cagatcatca ctgccctgga ggaggatggc acggcccaga agatgcagct gggctatcgg 6360 

ctccagcaga ttgcagctgc tgtggaaaac aaggtcacag atctatag 6408 



<210> 10 

<211> 2135 

<212> PRT 

<213> HOMO SAPIEN 

<400> 10 



Met 


Pro 


Ala 


Leu 


Gly 


Pro 


Ala 


Leu 


1 








5 








Val 


Leu 


Thr 


Leu 


Gin 


Pro 


Leu 


Pro 








20 










Thr 


Tyr 


Leu 


Gin 


His 


Leu 


Ala 


Arg 


















Leu 


Gly 


Ala 


Thr 


Asn 


Phe 


Leu 


Phe 




50 










55 




Glu 


Ala 


Thr 


Val 


Ser 


Thr 


Gly 


Pro 


65 










70 




Pro 


Pro 


Val 


Met 


Pro 
85 


Asp 


Glu 


Cys 


Pro 


Asn 


Gin 


Leu 


Leu 


Leu 


Val 


Ser 








100 










Ser 


Val 


His 


Gin 


Gly 


Val 


Cys 


Glu 






115 






120 


Gin 


Leu 


Leu 


Leu 


Arg 


Pro 


Glu 


Arg 




130 










135 




Ala 


Asn 


Asp 


Pro 


Ala 


Val 


Ser 


Thr 


145 










150 






Ala 


Gly 


Glu 


Pro 


Leu 


Leu 


Phe 


Val 










165 








Val 


Gly 


Gly 


Gly 


He 


Pro 


Pro 


He 








180 










Asp 


Pro 


Gin 


Ala 


Ala 


Phe 


Ser 


Tyr 






195 










200 


Gly 


Arg 


Leu 


Ser 


Glu 


Tyr 


Ser 


His 




210 










215 




Gly 


Ala 


Ser 


Ala 


Tyr 


Phe 


Leu 


Phe 


225 










230 






Ser 


Arg 


Ala 


Phe 


Arg 


Ala 


Tyr 


Val 










245 








His 


Tyr 


Tyr 


Ser 


Tyr 


Val 


Glu 


Leu 



260 



Leu 


Gin 


Ala 


Leu 


Trp 


Ala Gly Trp 




10 








15 


Pro 


Thr 


Ala 


Phe 


Thr 


Pro Asn Gly 


25 










30 


Asp 


Pro 


Tnr 


Ser 


Gly 
45 


Thr Leu Tyr 


Gl 










Leu Gin Leu 








60 






Val 


Leu 


Asp 
75 


Ser 


Arg 


Asp Cys Leu 
80 


Pro 


Gin 


Ala 


Gin 


Pro 


Thr Asn Asn 




90 








95 


Pro 


Gly 


Ala 


Leu 


Val 


Val Cys Gly 


105 










110 


Gin 


Arg 


Arg 


Leu 


Gly 


Gin Leu Glu 








125 




Pro 


Gly 


Asp 


Thr 


Gin 


Tyr Val Ala 








140 






Val 


Gly 


Leu 


Val 


Ala 


Gin Gly Leu 






155 






160 


Gly 


Arg 


Gly 


Tyr 


Thr 


Ser Arg Gly 




170 








175 


Thr 


Thr 


Arg 


Ala 


Leu 


Trp Pro Pro 


185 










190 


Glu 


Glu 


Thr 


Ala 


Lys 


Leu Ala Val 










205 




His 


Phe 


Val 


Ser 


Ala 


Phe Ala Arg 








220 






Leu 


Arg 


Arg 


Asp 


Leu 


Gin Ala Gin 






235 






240 


Ser 


Arg 


Val 


Cys 


Leu 


Arg Asp Gin 




250 








255 


Pro 


Leu 


Ala 


Cys 


Glu 


Gly Gly Arg 


265 










270 
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Tyr Gly Leu 


He 


Gin 


Ala 


Ala 


Ala 


Val 


Ala 


Thr 


Ser 


Arg 


Glu 


Val 


Ala 


275 










280 










285 








His Gly Glu Val 


Leu 


Phe 


Ala 


Ala 


Phe 


Ser 


Ser 


Ala 


Ala 


Pro 


Pro 


Thr 










295 










300 










Val Gly Arg 


Pro 


Pro 


Ser 


Ala 


Ala 


Ala 


Gly 


Ala 


Ser 


Gly 


Ala 


Ser 


Ala 








310 










315 










320 


L^ C Al 


Ph 

e 


Pro 


Leu 


Asp 


Glu 


Val 


Asp 


Arg 


Leu 


Ala 


Asn 


Arg 


Thr 






325 










330 










335 




Ar A Al 
g sp a 




Tyr 


Thr 


Arg 


Glu 


Gly 


Arg 


Ala 


Glu 


Asp 


Gly 


Thr 


Glu 




340 










345 










350 






Val Ala Tyir 


He 


Glu 


Tyr 


Asp 


Val 




Ser 


Asp 


Cys 


Ala 


Gin 


Leu 


Pro 












360 










365 








Val Asp Thr 


Leu 


Asp 


Ala 


Tyr 


Pro 


Cys 


Gly 


Ser 


Asp 


His 


Thr 


Pro 


Ser 


370 








375 










380 










Pro Met Ala 


Ser 


Arg 


Val 


Pro 


Leu 


Glu 


Ala 


Thr 


Pro 


He 


Leu 


Glu 


Trp 


385 






390 










395 










400 


Pro Gly lie 


Gin 




Thr 


Ala 


Val 


Ala 


Val 


Thr 


Met 


Glu 


Asp 


Gly 


His 






405 










410 










415 




Thr He Ala 


Phe 


Leu 


Gly 


Asp 


Ser 


Gin 


Gly 


Gin 


Leu 


His 


Arg 


Val 


Tyr 




420 










425 










430 






Leu Gly Pro 


Gly 


Ser 


Asp Gly 


His 


Pro 


Tyr 


Ser 


Thr 


Gin 


Ser 


He 


Gin 


435 










440 










445 








Gin Gly Ser Ala Val 


Ser 




Asp 


Leu 


Thr 


Phe 


Asp 


Gly 


Thr 


Phe 


Glu 


450 








455 










460 










His Leu Tyr Val 


Met 


Thr 


Gin 


Ser 


Thr 


Leu 


Leu 


Lys 


Val 


Pro 


Val 


Ala 


465 






470 










475 










480 


Ser Cys Ala 


Gin 


His 


Leu 


Asp 


Cys 


Ala 


Ser 


Cys 


Leu 


Ala 


His 


Arg 


Asp 






485 










490 










495 




Pro Tyr Cys 


Gly Trp Cys Val 


Leu 


Leu 


Gly 


Arg 


Cys 


Ser 


Arg 


Arg 


Ser 




500 










505 










510 






Glu Cys Ser Arg 


Gly Gin Gly 


Pro 


Glu 


Gin 


Trp 


Leu 


Trp 


Ser 


Phe 


Gin 


515 










520 










525 








Pro Glu Leu Gly Cys 


Leu 


Gin 


Val 


Ala 


Ala 


Met 


Ser 


Pro 


Ala 


Asn 


He 


530 








535 










540 










Ser Arg Glu 


Glu 


Thr Arg 


Glu 


Val 


Phe 


Leu 


Ser 


Val 


Pro 


Asp 


Leu 


Pro 


545 






550 










555 










560 


Pro Leu Trp 


Pro Gly 


Glu 


Ser 


Tyr 


Ser 


Cys 


His 


Phe 


Gly 


Glu 


His 


Gin 






565 










570 










575 




Ser Pro Ala 


Leu 


Leu Thr Gly 


Ser 


Gly 


Val 


Met 


Cys 


Pro 


Ser 


Pro 


Asp 




580 










585 










590 






Pro Ser Glu 


Ala 


Pro 


Val 


Leu 


Pro 


Arg 


Gly 


Ala 


Asp 


Tyr 


Val 


Ser 


Val 


595 










600 










605 








Ser Val Glu 


Leu 


Arg 


Phe Gly 


Ala 


Val 


Val 


He 


Ala 


Lys 


Thr 


Ser 


Leu 


610 






615 










620 










Ser Phe Tyr Asp 


Cys 


Val 


Ala 


Val 


Thr 


Glu 


Leu 


Arg 


Pro 


Ser 


Ala 


Gin 


625 






630 










635 










640 


Cys Gin Ala Cys 


Val 


Ser 


Ser 


Arg 


Trp 


Gly 


Cys 


Asn 


Trp 


Cys 


val 


Trp 






645 










650 










655 




Gin His Leu 


Cys 


Thr 


His 




Ala 


Ser 


Cys 


Asp 


Ala 


Gly 


Pro 


Met 


Val 




660 










665 










670 






Ala Ser His 


Gin 


Ser 


Pro 




Val 


Ser 


Pro 


Asp 


Pro 


Pro 


Ala 


Arg 


Gly 


675 










680 










685 








Gly Pro Ser 


Pro 


Ser 


Pro 


Pro 


Thr 


Ala 


Pro 


Lys 


Ala 


Leu 


Ala 


Thr 


Pro 


690 








695 










700 










Ala Pro Asp 


Thr 


Leu 


Pro 


Val 


Glu 


Pro 


Gly 


Ala 


Pro 


Ser 


Thr 


Ala 


Thr 
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705 








710 








715 










720 


Ala Ser 


Asp 


lie 


Ser 


Pro Gly 


Ala 


Ser 


Pro 


Ser 


Leu 


Leu 


Ser 


Pro 


Trp 








725 








730 










735 




Gly Pro 


Trp 


Ala Gly 


Ser Gly 


Ser 


He 


Ser 


Ser 


Pro 


Gly 


Ser 


Thr 


Gly 














745 










750 






Ser Pro 


Leu 


is 


n 

u 


Glu Pro 


Ser 


Pro 


Pro 


Ser 


Pro 


Gin 




Gly 


Pro 




755 








760 










765 








Gly Thr 


Ala 


, 

a 


ro 


Ala Pro 


Thr 


Asp 


Phe 


Arg 


Pro 


Ser 


Ala 


Thr 


Pro 


770 








775 










780 










Glu Asp 


Leu 


eu 


a 


Ser Pro 


Leu 


Ser 


Pro 


Ser 


Glu 


Val 


Ala 


Ala 


Val 


785 








790 








795 










800 




Ala 


sp 




Gly 


Glu 


Ala 


Leu 


His 


Pro 


Thr 


Val 


Pre 








80S 






810 










815 




Asp Leu 


Pro 






Thr Val 


Pro 


Ala 


Thr 


Thr 


Phe 


Pro 


Gly 


Ala 


Met 














825 










830 






Gly Ser 


Val 


ys 


ro 


Ala Leu 


Asp 


Trp 


Leu 


Thr 


Arg 


Glu 


Gly 


Gly 


Glu 




835 








840 










845 








Leu Pro 


Glu 


, 


ASP 


Glu Trp 


Thr 


Gly 


Gly 


Asp 


Ala 


Pro 


Ala 


Phe 


Ser 


850 








855 










860 












Thr 


Leu 


Leu 




Asp 


Gly 


Asp 


Ser 


Ala 


Glu 


Leu 


Glu 


Gly 


865 








870 Y 








875 










880 




Ala 


Pro 


Leu 


He Leu 


Pro 


Ser 


Ser 




Asp 


Tyr 


Gin 


Tyr 


Asp 








885 








890 










895 




Thr Pro 


Gly 


Leu 


Trp 


Glu Leu 


Glu 


Glu 


Ala 


Thr 


Leu 


Gly 


Ala 


Ser 


Ser 






900 








905 










910 






Cys Pro 


Cys 


Val 


Glu 


Ser Val 


Gin 


Gly 


Ser 


Thr 


Lou 


Met 


Pro 


Val 


His 




915 








920 










925 








Val Glu 




Glu 


He 


Arg Leu 




Gly 


Arg 


Asn 


Leu 


His 


Leu 


Phe 


Gin 


930 








935 










940 










Asp Gly 


Pro 


Gly Asp 


Asn Glu 


Cys 


Val 


Met 


Glu 


Leu 


Glu 


Gly 


Leu 


Glu 


945 








950 








955 










960 


Val Val 


Val 


Glu 


Ala 


Arg Val 


Glu 


Cys 


Glu 


Pro 


Pro 


Pro 


Asp 


Thr 


Gin 








965 






970 










975 




Cys His 


Val 


Thr 


Cys 


Gin Gin 


His 


Gin 


Leu 


Ser 


Tyr 


Glu 


Ala 


Leu 


Gin 






980 








985 










990 






Pro Glu 


Leu 


Arg Val 


Gly Leu 


Phe 


Leu 


Arg 


Arg 


Ala 


Gly 


Arg 


Leu 


Arg 




995 








1000 








1005 






Val Asp 


Ser 


Ala 


Glu 


Gly Leu 


His 


Val 


Val 




Tyr 


Asp 


Cys 


Ser 


Val 


1010 






1015 








1020 








Gly His 


Gly 


Asp Cys 


Ser Arg Cys 


Gin 


Thr 


Ala 


Met 


Pro 


Gin 


Tyr 


Gly 


1025 








1030 








1035 








1040 


Cys Val 


Trp 


Cys 


Glu 


Gly Glu Arg 


Pro 


Arg 


Cys Val 


Thr Arg 


Glu 


Ala 



1045 1050 1055 



Cys Gly Glu Ala Glu Ala Val Ala Thr Gin Cys Pro Ala Pro Leu He 

1060 1065 1070 

His Ser Val Glu Pro Leu Thr Gly Pro Val Asp Gly Gly Thr Arg Val 

1075 1080 1085 

Thr He Arg Gly Ser Asn Leu Gly Gin His Val Gin Asp Val Leu Gly 

1090 1095 1100 

Met Val Thr Val Ala Gly Val Pro Cys Ala Val Asp Ala Gin Glu Tyr 
1105 1110 1115 1120 

Glu Val Ser Ser Ser Leu Val Cys He Thr Gly Ala Ser Gly Glu Glu 

1125 1130 1135 

Val Ala Gly Ala Thr Ala Val Glu Val Pro Gly Arg Gly Arg Gly Val 
1140 1145 1150 
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Ser Glu His Asp Phe Ala Tyr Gin Asp Pro Lys Val His Ser lie Phe 

1155 1160 1165 

Pro Ala Arg Gly Pro Arg Ala Gly Gly Thr Arg Leu Thr Leu Asn Gly 

1170 1175 1180 

Ser Lys Leu Leu Thr Gly Arg Leu Glu Asp lie Arg Val Val Val Gly 
1185 1190 1195 1200 

Asp Gin Pro Cys His Leu Leu Pro Glu Gin Gin Ser Glu Gin Leu Arg 

1205 1210 1215 

Cys Glu Thr Ser Pro Arg Pro Thr Pro Ala Thr Leu Pro Val Ala Val 

1220 1225 1230 

Trp Phe Gly Ala Thr Glu Arg Arg Leu Gin Arg Gly Gin Phe Lys Tyr 

1235 1240 1245 

Thr Leu Asp Pro Asn lie Thr Ser Ala Gly Pro Thr Lys Ser Phe Leu 

1250 1255 1260 

Ser Gly Gly Arg Glu He Cys Val Arg Gly Gin Asn Leu Asp Val Val 
1265 1270 1275 1280 

Gin Thr Pro Arg He Arg Val Thr Val Val Ser Arg Met Leu Gin Pro 

1285 1290 1295 

Ser Gin Gly Leu Gly Arg Arg Arg Arg Val Val Pro Glu Thr Ala Cys 

1300 1305 1310 

Ser Leu Gly Pro Ser Cys Ser Ser Gin Gin Phe Glu Glu Pro Cys His 

1315 1320 1325 

Val Asn Ser Ser Gin Leu He Thr Cys Arg Thr Pro Ala Leu Pro Gly 

1330 1335 1340 

Leu Pro Glu Asp Pro Trp Val Arg Val Glu Phe He Leu Asp Asn Leu 
1345 1350 1355 1360 

Val Phe Asp Phe Ala Thr Leu Asn Pro Thr Pro Phe Ser Tyr Glu Ala 

1365 1370 1375 

Asp Pro Thr Leu Gin Pro Leu Asn Pro Glu Asp Pro Thr Met Pro Phe 

1380 1385 1390 

Arg His Lys Pro Gly Ser Val Phe Ser Val Glu Gly Glu Asn Leu Asp 

1395 1400 1405 

Leu Ala Met Ser Lys Glu Glu Val Val Ala Met He Gly Asp Gly Pro 

1410 1415 1420 

Cys Val Val Lys Thr Leu Thr Arg His His Leu Tyr Cys Glu Pro Pro 
1425 1430 1435 1440 

Val Glu Gin Pro Leu Pro Arg His His Ala Leu Arg Glu Ala Pro Asp 

1445 1450 1455 

Ser Leu Pro Glu Phe Thr Val Gin Met Gly Asn Leu Arg Phe Ser Leu 

1460 1465 1470 

Gly His Val Gin Tyr Asp Gly Glu Ser Pro Gly Ala Phe Pro Val Ala 

1475 1480 1485 

Ala Gin Val Gly Leu Gly Val Gly Thr Ser Leu Leu Ala Leu Gly Val 

1490 1495 1500 

He He He Val Leu Met Tyr Arg Arg Lys Ser Lys Gin Ala Leu Arg 
1505 1510 1515 1520 

Asp Tyr Lys Lys Val Gin He Gin Leu Glu Asn Leu Glu Ser Ser Val 

1525 1530 1535 

Arg Asp Arg Cys Lys Lys Glu Phe Thr Asp Leu Met Thr Glu Met Thr 

1540 1545 1550 

Asp Leu Thr Ser Asp Leu Leu Gly Ser Gly He Pro Phe Leu Asp Tyr 

1555 1560 1565 

Lys Val Tyr Ala Glu Arg He Phe Phe Pro Gly His Arg Glu Ser Pro 

1570 1575 1580 

Leu His Arg Asp Leu Gly Val Pro Glu Ser Arg Arg Pro Thr Val Glu 
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1585 1590 1595 1600 

Gin Gly Leu Gly Gin Leu Ser Asn Leu Leu Asn Ser Lys Leu Phe Leu 

1605 1610 1615 

Thr Lys Phe He His Thr Leu Glu Ser Gin Arg Thr Phe Ser Ala Arg 

1620 1625 1630 

Asp Arg Ala Tyr Val Ala Ser Leu Leu Thr Val Ala Leu His Gly Lys 

1635 1640 1645 

Leu Glu Tyr Phe Thr Asp He Leu Arg Thr Leu Leu Ser Asp Leu Val 

1650 1655 1660 

Ala Gin Tyr Val Ala Lys Asn Pro Lys Leu Met Leu Arg Arg Thr Glu 
1665 1670 1675 1680 

Thr Val Val Glu Lys Leu Leu Thr Asn Trp Met Ser He Cys Leu Tyr 

1685 1690 1695 

Thr Phe Val Arg Asp Ser Val Gly Glu Pro Leu Tyr Met Leu Phe Arg 

1700 1705 1710 

Gly He Lys His Gin Val Asp Lys Gly Pro Val Asp Ser Val Thr Gly 

1715 1720 1725 

Lys Ala Lys Tyr Thr Leu Asn Asp Asn Arg Leu Leu Arg Glu Asp Val 

1730 1735 1740 

Glu Tyr Arg Pro Leu Thr Leu Asn Ala Leu Leu Ala Val Gly Pro Gly 
1745 1750 1755 " 1760 

Ala Gly Glu Ala Gin Gly Val Pro Val Lys Val Leu Asp Cys Asp Thr 

1765 1770 1775 

He Ser Gin Ala Lys Glu Lys Met Leu Asp Gin Leu Tyr Lys Gly Val 

1780 1785 1790 

Pro Leu Thr Gin Arg Pro Asp Pro Arg Thr Leu Asp Val Glu Trp Arg 

1795 1800 1805 

Ser Gly Val Ala Gly His Leu He Leu Ser Asp Glu Asp Val Thr Ser 

1810 1815 1820 

Glu Val Gin Gly Leu Trp Arg Arg Leu Asn Thr Leu Gin His Tyr Lys 
1825 1830 1835 1840 

Val Pro Asp Gly Ala Thr Val Ala Leu Val Pro Cys Leu Thr Lys His 

1845 1850 1855 

Val Leu Arg Glu Asn Gin Asp Tyr Val Pro Gly Glu Arg Thr Pro Met 

1860 1865 1870 

Leu Glu Asp Val Asp Glu Gly Gly He Arg Pro Trp His Leu Val Lys 

1875 1880 1885 

Pro Ser Asp Glu Pro Glu Pro Pro Arg Pro Arg Arg Gly Ser Leu Arg 

1890 1895 1900 

Gly Gly Glu Arg Glu Arg Ala Lys Ala He Pro Glu He Tyr Leu Thr 
1905 1910 1915 1920 

Arg Leu Leu Ser Met Lys Gly Thr Leu Gin Lys Phe Val Asp Asp Leu 

1925 1930 1935 

Phe Gin Val He Leu Ser Thr Ser Arg Pro Val Pro Leu Ala Val Lys 

1940 1945 1950 

Tyr Phe Phe Asp Leu Leu Asp Glu Gin Ala Gin Gin His Gly He Ser 

1955 1960 1965 

Asp Gin Asp Thr He His He Trp Lys Thr Asn Ser Leu Pro Leu Arg 

1970 1975 1980 

Phe Trp He Asn He He Lys Asn Pro Gin Phe Val Phe Asp Val Gin 
1985 1990 1995 2000 

Thr Ser Asp Asn Met Asp Ala Val Leu Leu Val He Ala Gin Thr Phe 

2005 2010 2015 

Met Asp Ala Cys Thr Leu Ala Asp His Lys Leu Gly Arg Asp Ser Pro 
2020 2025 2030 
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He Asn Lys Leu Leu Tyr Ala Arg Asp He Pro Arg Tyr Lys Arg Met 

2035 2040 2045 

Val Glu Arg Tyr Tyr Ala Asp He Arg Gin Thr Val Pro Ala Ser Asp 

2050 2055 2060 

Gin Glu Met Asn Ser Val Leu Ala Glu Leu Ser Trp Asn Tyr Ser Gly 
2065 2070 2075 2080 

Asp Leu Gly Ala Arg Val Ala Leu His Glu Leu Tyr Lys Tyr He Asn 

2085 2090 ~ 2095 

Lys Tyr Tyr Asp Gin He He Thr Ala Leu Glu Glu Asp Gly Thr Ala 

2100 2105 2110 

Gin Lys Met Gin Leu Gly Tyr Arg Leu Gin Gin He Ala Ala Ala Val 

2115 2120 2125 

Glu Asn Lys Val Thr Asp Leu 
2130 2135 

<210> 11 

<211> 2190 

<212> DNA 

<213> HOMO SAPIEN 

<400> 11 

atgcctgctc tgggcccagc tcttctccag gctctctggg ccgggtgggt cctcaccctc 60 

cagccccttc caccaactgc attcactccc aatggcacgt atctgcagca cctggcaagg 120 

gaccccacct caggcaccct ctacctgggg gctaccaact tcctgttcca gctgagccct 180 

gggctgcagc tggaggccac agtgtccacc ggccctgtgc tagacagcag ggactgcctg 240 

ccacctgtga tgcctgatga gtgcccccag gcccagccta ccaacaaccc gaatcagctg 300 

ctcctggtga gcccaggggc cctggtggta tgcgggagcg tgcaccaggg ggtctgtgaa 360 

cagcggcgcc tggggcagct cgagcagctg ctgctgcggc cagagcggcc tggggacaca 420 

caatatgtgg ctgccaatga tcctgcggtc agcacggtgg ggctggtagc ccagggcttg 480 

gcaggggagc ccctcctgtt tgtggggcga ggatacacca gcaggggtgt ggggggtggc 540 

attccaccca tcacaacccg ggccctgtgg ccgcccgacc cccaagctgc cttctcctat 600 

gaggagacag ccaagctggc agtgggccgc ctctccgagt acagccacca cttcgtgagt 660 

gcctttgcac gtggggccag cgcctacttc ctgttcctgc ggcgggacct gcaggctcag 72 0 

tctagagctt ttcgtgccta tgtatctcga gtgtgtctcc gggaccagca ctactactcc 780 

tatgtggagt tgcctctggc ctgcgaaggt ggccgctacg ggctgatcca ggctgcagct 840 

gtggccacgt ccagggaggt ggcgcatggg gaggtgctct ttgcagcttt ctcctcggct 900 

gcacccccca ctgtgggccg gcccccatcg gcggctgctg gggcatctgg agcctctgcc 960 

ctctgtgcct tccccctgga tgaggtggac cggcttgcta atcgcacgcg agatgcctgc 1020 

tacacccggg agggtcgtgc tgaggatggg accgaggtgg cctacatcga gtatgatgtc 1080 

aattctgact gtgcacagct gccagtggac accctggatg cttatccctg tggctcagac 1140 

cacacgccca gccccatggc cagccgggtc ccgctggaag ccacaccaat tctggagtgg 1200 

ccagggattc agctaacagc tgtggcagtc accatggaag atggacacac catcgctttc 1260 

ctgggtgata gtcaagggca gctgcacagg gtctacttgg gcccagggag cgatggccac 1320 

ccatactcca cacagagcat ccagcagggg tctgcagtga gcagagacct cacctttgat 1380 

gggacctttg agcacctgta tgtcatgacc cagagcacac ttctgaaggt tcctgtggct 1440 

tcctgtgctc agcacctgga ctgtgcatct tgccttgctc acagggaccc atactgtggg 1500 

tggtgcgtgc tccttggcag gtgcagtcgc cgttctgagt gctcgagggg ccagggccca 1560 

gagcagtggc tatggagctt ccagcctgag ctgggctgtc tgcaagtggc agccatgagt 1620 

cctgccaaca tcagccgaga ggagacgagg gaggttttcc tatcagtgcc agacctgcca 1680 

cccctgtggc caggggagtc atattcctgc cactttgggg aacatcagag tcctgccctg 1740 

ctgactggtt ctggtgtgat gtgcccctcc ccagacccta gtgaggcccc agtgctgccg 1800 

agaggagccg actacgtatc cgtgagcgtg gagctcagat ttggcgctgt tgtgatcgcc 1860 

aaaacttccc tctctttcta tgactgtgtg gcggtcactg aactccgccc atctgcgcag 1920 

tgccaggcct gtgtgagcag ccgctggggg tgtaactggt gtgtctggca gcacctgtgc 1980 

acccacaagg cctcgtgtga tgctgggccc atggttgcaa gccatcaggt gatggagact 2040 
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cagcagagct tgagggccct cccgcccccc tcatcctccc gtccagcctc gactaccagt 2100 
atgacacccc cgggctctgg gagctggaag aggcgacctt gggggcaagc tcctgcccct 2160 
gtgtggagag cgttcagggc tccacgttga 2190 

<210> 12 

<211> 729 

<212> PRT 

<213> HOMO SAPIEN 



<400> 12 



Met Pro 


Ala 


Leu 


Gly 


Pro 


Ala 


Leu 


Leu 


Gin 


Ala 


Leu 


Trp 


Ala 


Gly 


Trp 


1 






5 










10 










15 




Val Leu 


Thr 


Leu 


Gin 


Pro 


Leu 


Pro 


Pro 


Thr 


Ala 


Phe 


Thr 


Pro 


Asn 


Gly 






20 










25 










30 






Thr Tyr 


Leu 


Gin 


His 


Leu 


Ala 


Arg 


Asp 


Pro 


Thr 


Ser 


Gly 


Thr 


" eu 


Tyr 




35 










40 










45 








Leu Gly 


Ala 


Thr 


Asn 


Phe 


Leu 


Phe 


Gin 


Leu 


Ser 


Pro 


Gly 


Leu 


Gin 


Leu 


50 










55 










60 










Glu Ala 


Thr 


Val 


Ser 


Thr 


Gly 


Pro 


Val 


Leu 


Asp 


Ser 


Arg 


Asp 


Cys 


Leu 


65 








70 










75 










80 


Pro Pro 


Val 


Met 


Pro 


Asp 


Glu 


Cys 


Pro 


Gin 


Ala 


Gin 


Pro 


Thr 


Asn 


Asn 








85 










90 










95 




Pro Asn 


Gin 


Leu 


Leu 


Leu 


Val 


Ser 


Pro 


Gly 


Ala 


Leu 


Val 


Val 


Cys 


Gly 






100 










105 










110 






Ser Val 


His 


Gin 


Gly 


Val 


Cys 


Glu 


Gin 


Arg 


Arg 


Leu 


Gly 


Gin 


_ieu 


G^u 




115 






120 










125 








Gin Leu 


Leu 


Leu 


Arg 


Pro 


Glu 


Arg 


Pro 


Gly 


Asp 


Thr 


Gin 


Tyr 


Val 


Ala 


130 










135 










140 










Ala Asn 


Asp 


Pro 


Ala 


Val 


Ser 


Thr 


Val 


Gly 


Leu 


Val 


Ala 


Gin 


Gly 


Leu 


145 






150 










155 










160 


Ala Gly 


Glu 


Pro 


Leu 


Leu 


Phe 


Val 


Gly 


Arg 


Gly 


Tyr 


Thr 


Ser 


Arg 


Gly 








165 










170 










175 




Val Gly 


Gly 


Gly 


He 


Pro 


Pro 


He 


Thr 


Thr 


Arg 


Ala 


Leu 


Trp 


Pro 


Pro 






180 










185 










190 






Asp Pro 


Gin 


Ala 


Ala 


Phe 


Ser 


Tyr 


Glu 


Glu 


Thr 


Ala 


Lys 


Leu 


Ala 


Val 




195 










200 










205 








Gly Arg 


Leu 


Ser 


Glu 


Tyr 


Ser 


His 


His 


Phe 


Val 


Ser 


Ala 


Phe 


Ala 


Arg 


210 










215 










220 












Ser 


Ala 


Tyr 


Phe 




Phe 


Leu 


Arg 


Arg 




Leu 


Gin 


Ala 


Gin 


225 








230 










235 










240 


Ser Arg 


Ala 


Phe 




Ala 


Tyr 


Val 


Ser 


Arg 


Val 


Cys 


Leu 


Arg 


Asp 


Gin 








245 










250 










255 




His Tyr 


Tyr 


Ser 


Tyr 


Val 


Glu 


Leu 


Pro 


Leu 


Ala 


Cys 


Glu 


Gly 


Gly 








260 










265 










270 






Tyr Gly 


Leu 


He 


Gin 


Ala 


Ala 


Ala 


Val 


Ala 


Thr 


Ser 


Arg 


Glu 


Val 


Ala 




275 










280 










285 








His Gly 


Glu 


Val 


Leu 


Phe 


Ala 


Ala 


Phe 


Ser 


Ser 


Ala 


Ala 


Pro 


Pro 


Thr 


290 










295 










300 










Val Gly 


Arg 


Pro 


Pro 


Ser 


Ala 


Ala 


Ala 


Gly 


Ala 


Ser 


Gly 


Ala 


Ser 


Ala 


305 








310 










315 










320 


Leu Cys 


Ala 


Phe 


Pro 


Leu 


Asp 


Glu 


Val 


Asp 


Arg 


Leu 


Ala 


Asn 


Arg 


Thr 








325 








330 










335 




Arg Asp 


Ala 


Cys 


Tyr 


Thr 


Arg 


Glu 


Gly 


Arg 


Ala 


Glu 


Asp 


Gly 


Thr 


Glu 






340 










345 










350 






Val Ala 


Tyr 


He 


Glu 


Tyr 


Asp 


Val 


Asn 


Ser 


Asp 


Cys 


Ala 


Gin 


Leu 


Pro 
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355 




360 


Val 


Asp 


Thr 


Leu Asp 


Ala Tyr Pro 




370 






375 


Pro 


Met 


Ala 


Ser Arg 


Val Pro Leu 


385 








390 


Pro Gly 


He 


Gin Leu 


Thr Ala Val 








405 




Thr 


He 


Ala 


Phe Leu 


Gly Asp Ser 








420 




Leu Gly 


Pro 


Gly Ser 


Asp Gly His 






435 




440 


Gin Gly 


Ser 


Ala Val 


Ser Arg Asp 




450 






455 


His 


Leu 


Tyr 


Val Met 


Thr Gin Ser 


465 






470 


Ser 


Cys 


Ala 


Gin His 


Leu Asp Cys 








485 




Pro 


Tyr 


Cys 


Gly Trp 


Cys Val Leu 








500 




Glu 


Cys 


Ser 


Arg Gly 


Gin Gly Pro 






515 




520 


Pro 


Glu 


Leu 


Gly Cys 


Leu Gin Val 




530 






535 


Ser 


Arg 


Glu 


Glu Thr 


Arg Glu Val 


545 








550 


Pro 


Leu 


Trp 


Pro Gly 


Glu Ser Tyr 








565 




Ser 


Pro 


Ala 


Leu Leu 


Thr Gly Ser 








580 




Pro 


Ser 


Glu 


Ala Pro 


Val Leu Pro 






595 




600 


Ser 


Val 


Glu 


Leu Arg 


Phe Gly Ala 




610 






615 


Ser 


Phe 


Tyr 


Asp Cys 


Val Ala Val 


625 








630 


Cys 


Gin 


Ala 


Cys Val 


Ser Ser Arg 








645 




Gin 


His 


Leu 


Cys Thr 


His Lys Ala 








660 




Ala 


Ser 


His 


Gin Val 


Met Glu Thr 






675 




680 


Pro 


Pro 


Ser 


Ser Ser 


Arg Pro Ala 




690 






695 


Gly 


Ser 


Gly 


Ser Trp 


Lys Arg Arg 


705 








710 


Val 


Trp 


Arg 


Ala Phe 


Arg Ala Pro 








725 





365 



Cys Gly 


Ser 


Asp 


His 


Thr 


Pro 


Ser 






380 










Glu Ala 


Thr 


Pro 


He 


Leu 


Glu 


Trp 




395 










400 


Ala Val 


Thr 


Met 


Glu 


Asp 


Gly 


His 


410 










415 




Gin Gly 


Gin 


Leu 


His 


Arg 


Val 


Tyr 


425 








430 






Pro Tyr 


Ser 


Thr 


Gin 


Ser 


He 


Gin 








445 








Leu Thr 


Phe 


Asp 


Gly 


Thr 


Phe 


Glu 






460 








Thr Leu 


Leu 


Lys 


Val 


Pro 


Val 


Ala 




475 










480 


Ala Ser 


Cys 


Leu 


Ala 


His 


Arg 


Asp 


490 










495 




Leu Gly 


Arg 


Cys 


Ser 


Arg 


Arg 


Ser 


505 








510 






Glu Gin 


Trp 


Leu 


Trp 


Ser 


Phe 


Gin 








525 








Ala Ala 


Met 


Ser 


Pro 


Ala 


Asn 


He 






540 










Phe Leu 


Ser 


Val 


Pro 


Asp 


Leu 


Pro 




555 










560 


Ser Cys 


His 


Phe 


Gly 


Glu 


His 


Gin 


570 










575 




Gly Val 


Met 


Cys 


Pro 


Ser 


Pro 


Asp 


585 








590 






Arg Gly 


Ala 


Asp 


Tyr 


Val 


Ser 


Val 








605 








Val Val 


He 


Ala 


Lys 


Thr 


Ser 


Leu 






620 










Thr Glu 


Leu 


Arg 


Pro 


Ser 


Ala 


Gin 




635 








640 


Trp Gly 


Cys 


Asn 


Trp 


Cys 


Val 


Trp 


650 










655 




Ser Cys 


Asp 


Ala 


Gly 


Pro 


Met 


Val 
















Gin Gin 


Ser 


Leu 


Arc 


Ala 


Leu 


Pro 








685 








Ser Thr 


Thr 


Ser 


Met 


Thr 


Pro 


Pro 






700 










Pro Trp 


Gly 


Gin 


Ala 


Pro 


Ala 


Pro 




715 










720 


Arg 
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